Journal of 
AMERICAN STATISTICAL 
ASSOCIATION 


JUNE 1957 


Errors of the Third Kind in Statistical Consulting. cS ee A. W. Kimball 


National ‘Income Statistics of Underdeveloped Countries 


Estimating the Service Life of Household Goods by Actuarial Methods 
Jean L, Pennock «nd Carol M. Jaeger 


Control to Photographic Pr 
. Edward and Robert i. 


Maximum Likelihood Estimates for a weak: % Normal Distribution When 
Some Observations Are Missing ...............006.+.+; T. W. Anderson 


The Analysis of Incomplete Block Designs ... 
.M. B. Wilk and Oscar Kempthorne 


Normality Using Fractional Powers of the Variable 


Linear Tranlormation «Set of Stachatcally Dependent Normal Visible 
William J. 


The Regions of Unimodality ard Positivity in the Abbreviated rth and 


BOOK REVIEWS by R. R. er Francis M. Boddy, 
William I. Greenwald, Reginald R. Isaacs, Howard L. Jonvs, Leonard Kent, 
. Jesse W. wlarkham, Francis McIntyre, Jacob Mincer, Robert R. Nathan, John. 
M. Peterson, Herman G. Roseman, Theodore W. Schultz, E. S. nee John Re 


Spellman, I. A. Stegun, D. Teichroew . 265 
- REPORT OF BOARD OF DIRECTORS, 1956 ..................... 


VOLUME 52 NUMBER 278 


133 
 .... Harry T. Oshima 162 
175 
186 
204 
218 
237 
247 
The President’s Economic Repost Russell 257 


American Statistical Association 


Organized November 27, 1839, Incorporated 1841 


The American Statistical Association is a scientific and educational organi- 
zation. Its membership is not confined to professional statisticians but insludes 
economists, business executives, research directors, government officials, uni- 
versity professors, and others who are seriously interested in the application of 
statistice! methods to practical problems, in the development of more useful 
methods, and in the improvement of basic statistical data. Engineers, mathe- 
maticians, biologists, actuaries, sociologiste, psychologists, and representatives of 
many other professions are included in the membership of the Association. 


Annvuat Duss* 
Residents of North America... . 


Student membership Contributing membership 
Introductory membership (first Institutional membership, mini- 
dues payment of members un- 
der 20 years old) 


* Includes subscription to the Journal of the American Statistical Association, and to The An:eri- 


Subscription rate, $8 per year. Prices for back issues available on request. 
For further information about the Association and membership application 
forms, write the Secretary, 1757 K St., N.W., Washington 6, D. C. 


The present Institutional Members are: 


American Management Association International Harvester Com 
American Telephone & International Ladies Garmer: 
Telegraph Worker's Union 
Armour « Company Kennecott Copper Corporation 
Armstrong Cork Company Metropolitan Tre Insurance Company 
The Atlantic Refining Company National Analysts, Inc. 
Bell Telephone Laboratories National Association of eres 
Chrysler Corporation National Bank of Detroi 
The Columbia Gas System, Inc. National Industrial Foeciwitine 
Deere & Company Board, Ine. 
The Dow Chemical Company New York Telephone Company 
Dun and Bradstreet, Inc. Philadelphia Electric Company 
Eastman Kodak Company ’ Schering Corporation 
The Equitable Life Assurance Sears Roebuck & Company 
Socony Mobil Oil Company, Inc. 
Ford Motor Company Standard Oil Company (Indiana) 


zocds Standard Oi] Company of New Jersey 


Humble Oi! and Refining Co. Sun Oil ereny 4 
sters C arehousemen 
& Helpers of America United States Steel Corpcration 
International Business Western Electric Company, Inc. 
Machines Corp. ' Young & Rubicam, Inc. 


ea of address request. 


| 
25 
100 
OF im ol em- | 
| bodied in paragraph 4 section S88, P. L. & R.. authorised ~ 
matter at the Wisconsin. 
address taken from an issue a the periodical should accom: 
‘ 


JOURNAL OF THE AMERICAN 
STATISTICAL ASSOCIATION 


The Editors welcome the submission of manuscripts for possible publication. They 
should be typewritten entirely double-spaced, including footnotes, and two copies should 
be sent to the Editor, W. Allen Wallis, 207 Haskell Hall, University of Chicago, Chicago 
37. Books for review should be sent to the same address. Unsolicited book reviews are not 
accepted, but suggestions of titles for review are welcome. 


EDITOR 


W. Watuis, University of Chicago 
ASSISTANT TO THE Eprror: Dororuy D. 


ASSOCIATE EDITORS 
E1s—ENHART Hastay 
National Bureau of Standards City College of New York 
Currrorp HILDRETH Gerorce M. Kuznets 
Michigan State University University of California 
I. Ricwarp SavacEe 
Stanford University 


ADVISORY PANEL OF FORMER EDITORS 
G. Cocaran (1945-50) Frank A. Ross (1926-34, 41-45) 
Johns Hopkins University Thetford, Vermont 
F. (1920-25) Freperick F. Stepan (1935-40) 
University of Chicago Princeton University 


Corrigenda: Readers and authors are urged to submit tc the Editor notices of errors 
found in this or any previous volume. These will be published once a year, in the 
December issue. 


| 


EXPERIMENTAL DESIGNS, 2nd Ed. 


By William G. Cochran, The Johns Hopkins University, and Gertrude M. 
Cox, University of North Carolina. A favorite among students for the past six 
years, this working manual is now revised and expanded to take into account 
recent developments in the field. Retaining the general framework of the earlier 
edition, it presents plans of all the useful types of experimental designs and shows 
the kind of work for which each design is appropriate. One of the Wiley Pub- 
lications in Statistics, Walter A. Shewhart and S. S. Wilks, Editors. 1957. 
617 pages. Prob. $7.50. 


PRODUCTION FORECASTING, PLANNING, and CONTROL, 2nd Ed. 


By E. H. Mac Niece, Special Representative in Europe, ace 5 & Johnson. 
Reduces the actual methods and techniques of forecasting, planning, and control 
to a set of easily grasped principles. Thoroughly revised in its second edition, it 
includes much new material, and offers a completely up-to-date account of 
modern practices in the field. 1957. 374 pages. Illus. $7.00. 


INTRODUCTION to OPERATIONS RESEARCH 
By C. West Churchman, Russell L. Ackoff and E. Leonard Arnoff; all of 
the Case Institute of Technology. The only introductory text on operations re- 
search, this work emphasizes the method of approaching research from a scientific 
i ge ee and stresses the importance of defining management problems in terms 
of objectives and of administration of O. R. 1957. 645 pages. $10.00. 


MODELS of MAN: Social and Rational 


By Herbert A. Simon, Carnegie Institute of Technology. Sixteen related essays 
develop a coherent theory of the rational and non-rational aspects of human be- 


havior in a social setting. The book proposes a substantially new approach to 

rational decision-making in complex situations and gives the first systematic ex- 

ploration of the rationality of seeking “good enough” courses of action, as con- 

or oat the rationality of maximizing or optimizing. 1957. 287 pages. 
us. $5.00. 


LOCATION and SPACE-ECONOMY: A General Theory Relating to Industrial Location, 
Market Areas, Land Use, Trade, and Urban Structure 
By Walter Isard, University of Pennsylvania. “It is the most important book 
that has been published in this field, and I am confident that it will become the 
basis of much needed instruction. It is remarkable, because few books combi: so 
brilliantly the critical appraisal of existing literature with creative and construc, ve 
synthesis. It is a comprehensive and forward looking presentation of the whole 
on hee thought in this field. . . ."—-Abbott Peyson Usher, Harvard University. A 
Technology Book, M.I.T. 1956. 350 pages. $8.75. 


VECTOR SPACES and MATRICES 
By Robert M. Thrall, University of Michigan; and Leonard Tornheim, Cali- 
fornia Research Corporation. 1957. 318 pages. $6.75. 
AMERICAN FAMILIES 


By Paul Glick, Chief, Social Statistics Branch, U.S. Bureau of the Census. 
Census Monograph. 1957. 240 pages. $6.00. 


80, @. Send for your examination copies today. 


or ruscowne SOHN WILEY & SONS, Inc., 440 Fourth Avenue, New York 16, N.Y. 


Please mention the Journal of the Amenican Sratisticat Association in writing advertisers 


JOURNAL OF THE AMERICAN 
STATISTICAL ASSOCIATION 


52 JuNE, 1957 Numser 278 


ARTICLES 
Errors of the Third Kind in Statistical Consulting. . . . . A. W. Kimpaun 
Samplingina Nutshell. . . . . . . Morris James SLonm™ 
National Income Statistics of . . Harry T. 


= of Goods by Actuarial Methods _.. 
JEAN L. Pennock anv M. JAEGER 


An Applicaton of Multivariate Quality Control to Photographic Processi 5 
J. Epwarp Jackson AND Ropert H. Morris 


‘Likelihood Estimates a Distribution When 
Some Observations Are Missing. . . . .T. W. ANDERSON 


The Analysis of Incomplete Block Designs . . . Marvin ZELEN 
Non-Additivities in a Latin Square Design .M. B. Wux AND Oscar KEMPTHORNE 


.P. G. Moors 


Linear to a Bet of ‘Stochaatially Dependent N Variables. 
J. Moonan 


The ‘Salem of Unimodality and Positivity in the Abbreviated Edgeworth and 
Gram-Charlier Series . . Geratp D. Bernpt 


The President’s Economic 
STATISTICAL ABSTRACTS 


BOOK REVIEWS 


Nerer, JoHN, AND Statistics for Business and 
conomics . . . Francis McIntyre 


Manpet, B. J., Statistics for . . Jacop Mincer 


Vance, LAWRENCE L., AND Statistical Sampling Auditors and Ac- 
couniants . Howarp L. Jones 


AND F., A Primer of Social Statistics . 
. R. BanAaDUR 


Editor, the Third Berkeley on Mathemati- 


Auston §., Principles of Numerical Analysis . . D. TE1cHRoEW 
Levin, Howarp 8., Office Work and Automation . . . . JouN R. SPELLMAN 


Epwarps, FREDERICK, in Market ‘A Selection of Pa 
by British Authors. . . . LEONARD 


AND Eva, Consumer Bspectations, 1953-1956 
. Herman G. Roseman 


.Rospert R. NatHan 


During Cycles . Francis M. Boppy 


in the United States since 1870 
. . THEopors W, 


133 

143 

162 

175 

186 

200 

218 

237 

247 

253 

257 

263 

265 

267 
268 

| 270 
272 
272 

272 

273 

273 

276 

278 

279 

280 

281 


J., Trends in in the Service Industries. 
. JOHN M. Pererson 


Fisner, Wado E., anp M., Price Fixing in the Bitumi- 
nous Coal Industry . . Jesse W. 


Arxrinson, THoMas Re The Pattern of Financial Otonership: Indi- 
viduals, 1949 E. 8. 


RANNELLS, JOHN, The Core of the city: A Pitot Study wi Cha Sind Uses in Cen- 
tral Business Districts . EGINALD R. Isaacs 


PUBLICATIONS RECEIVED 


REPORT OF THE BOARD OF DIRECTORS, AN D SECRETARY-TREASURER 
FOR 1956 . 


282 
283 
285 
286 
287 
290 
292 


1957 OFFICERS, AMERICAN STATISTICAL ASSOCIATION 


President 
R. Leonarp 


Joun W. Tukey 


ALFRED N, Watson 
Joun BoatwrRicHT 


P. BrnpLoss 
Hues H. Brown 
L. Brown 
E. Burcsss 
Irvine Burr 

8. Lez Crump 
Epwarp F. Denison 
Lucite Derrick 
James L. 
R. J. 
Car. L. Exnarpt 
S. M. Freez 


Board of Directors 
President-Elect 
Waurer E. Hoaptey, Jr. 


Vice Presidents 
Martin R. GaInspRUGH 


Directors 
Lester R. FRANKEL 
W. J. 


Secretary-Treasurer 


Donan C, 


Members of the Council 
Irwin FRIEND 
Morris Hampura 
Boyrp HarsHBaRGER 
Paut Homeyver 
T. HORAN 
Wituram H. Kester 
Auuyn W. Kiwpatt, Jr. 
Dupuey 
Rosgrt Borp Lapp 
Joun LEHMAN 
Frep C. Leong 


Past President 
Gertrupe M. Cox 


CuuRCHILL EISENHART 


H. O. Harriey 
Grorrrey Moors 


Joun C. McKzr 
MEIER 
Horace W. Norton 
Roy L. Rererson 
Miuton F. Searu 
Cuar.es SEpGEWICK 
Joun H. 
ARTHUR STEIN 
Miuton Terry 
Henry Tucker 
Lepyrarp R. Tucker 
W. Auten 


Tables and other aids to computation appearing in this Journal are abstracted and 
indexed in Mathematical Tables and Other Aids to Computation. 

Reprints of most articles published since 1945 may be purchased from the Office of 
the Secretary. 


| 


EDITORIAL COLLABORATORS 


Moses Asramovitz, Stanford University 
Frank J. AnscomBeE, Princeton University 
as gs ARKIN, College of the City of New 


Yor 
G. A. Barnarp, Imperial College of Science 
and Tech 
Max A. BersuaD, Bureau of the Census 
Davip H. Buackwe.u, University of Cali- 
fornia (Berkeley) 
C. I. Buss, Connecticut Agricultural Experi- 
D U Ch 
ONALD J. Boaun, University o icago 
Arruour L. Brora, Federal System 
M. Cameron, National Bureau of 
Standards 
Mavis B. Carrow, General Foods Corpora- 


tion 
Herman Cuernorr, Stanford University 
W. 8. Connor, National Bureau of Stand- 


ards 
National Institutes of 
Danie CREAMER, New York Department of 
Health 


Tomas G. Dominion Bureau of 
Statistics 
AcuEson J. Duncan, Johns Hopkins Uni- 


versity 
Davin B. Duncan, University of North 
Carolina 
Merer Dwass, Northwestern University 
Paut 8. Dwyer, University of Michigan 
Henry Army Chemical Corps 
BENJAMIN EpstEIN, State Uniwwersity 
Marx W. California Municipal 
Statistics, Inc. 
Waurer T. Feperer, Cornell University 
Rosert Ferser, University of Illinois 
niversity a Aberdeen 
University of California 
(Berkeley) 
Lzo A. GoopMan, University of Chicago 
Evcene L, Grant, Stanford University 
Emu J. Columbi Universit; 
Joun GuRLAND, Iowa State College 
Dona.p Gurarie, Jr., Stanford University 
Louis Gurrman, Israel Institute of Applied 


Social Research 

Max Hatperin, National Institutes of 
Health 

H. O. Harrier, Jowa State Colle 

Watter C. Jacos, University of Illinois 

Howakrp L. Jonzs, Illinois Bell Telephone 
Company 


Jutes Josxow, City College of New York 

Oscar KemprHorne, Jowa State College 

Joun W. Kenpricx, George Washington 

H. Krusxan, University of Chi- 


cago 

Sotomon George Washington 
University 

E. L. Lenmann, University of California 
(Berkeley) 

W. Arruour Lewis, University of Manches- 
ter 


Gerap J. LizperMan, Stanford University 

National Bureau of 
Standards 

Ciarence D. Lone, Johns Hopkins Uni- 
versity 

Mavow, University of Illinois 

NatHan*’ Manteu, National Institutes of 


Health 
Watrer*P. Merzcer, Columbia University 
James N. Morgan, University of Michigan 
Morris )NEwWMAN, National Bureau of 
Standards 
Harowp NissEe.son, Bureau of the Census 
Gorrrriep E. NoretueEr, Boston University 
Warren Notrer, University of Virginia 
Epwin G. Ops, Carnegie Institute of Tech- 


nology 

Guy H. Orcurt, Harvard University 

Wiuuram R. Passt, Navy Bureau of 
Ordnance 

Hersert Rossins, Columbia University 

Harry V. Roserts, University of Chicago 

W. 8. Roprnson, University of California 
(Los Angeles) 

RosEnBLuUTH, Queen’s University 

L. Scort, University of Califor- 
nia (Berkeley) 

Norman C. Severo, National Bureau of 
Standards 

Dovatas B. Smita, Chase Manhattan Bank 

Hersert Sotomon, Columbia University 

Paut N. American University 

H. C. 8. Txuom, Advisory Committee on 
Weather Control 

Davip F. Votaw, Jr., Yale University 

M. Waker, Columbia University 

Davin L. Watuace, University of Chicago 

B. L. Weuicu, University of Leeds 

Youven, Natiénal Bureau of Stand- 


JOURNAL OF THE AMERICAN 
STATISTICAL ASSOCIATION 


Number 278 JUNE 1957 Volume 58 


ERRORS OF THE THIRD KIND IN STATISTICAL CONSULTING 


A. W. 
Oak Ridge National Laboratory 


Because graduate students in statistics are given little, if any, prep- 
aration for actual consulting, they are prone, particularly in their early 
years, to commit errors of the third kind, many of which could be 
avoided if the students were properly trained. Errors of the third kind 
are defined and are illustrated with actual examples from consulting 
experience. The cases used represent types of error which result from 
different situations that arise frequently in practice. Some discussion 
is included of possible remedies for this problem that are suggested by 
the experience of educators in other fields. 


INTRODUCTION 


T A relatively early age in their graduate academic life, students of statistics 
become familiar with certain risks associated with what they come to 
know as the first and second kinds of error in the theory of testing hypotheses. 
They soon learn that in many widely used statistical tests the first kind of error 
is easy to control but that often the risk of the second kind of error is difficult 
to compute and more often neglected entirely in practice. The importance of 
these errors is constantly brought to their attention through emphasis in their 
course work on such things as uniformly most powerful tests and sequential 
procedures which control the risks of both kinds of error. More recently the 
theory of decision making, the natural sequel to hypothesis testing, has ele- 
vated the notion of risk to an even higher Place i in the hierarchy of ideas passed 
on from professor to student. 

As a result of these teachings many of today’s statistics graduates come 
away from the warm comfort of university complacence into the coldly realistic 
outside world imbued with the idea (and probably rightly so) that the statis- 
tician’s only real function in this world is to compute risks of error for other 
people who have to make decisions. To be sure, there is a vast amount of plan- 
ning (design of experiments, model building) and intermediate adjustment 
(missing data, extreme observations) necessary before the statistician can 
estimate these risks, but essentially this is his main task, and the student finds 
it out usually before the end of the first semester. 

Consider then the embryo statistician who has been released from the uni- 
versity’s uterus with a shiny new degree and who proceeds on his mission as 
- a risk computer fully equipped with the tools of his trade and the mental 
wherewithal to apply them. Let us assume that during the first few ycars of his 
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initiation as a consulting statistician he is lucky, from a mathematical statistics 
point of view, and computes correctly the risks of error for all problems he 
tackles. The chances are, speaking nonmathematically, that during this time 
he will commit the third kind of error more often than he or anyone else realizes. 
What is even more tragic is that, although as a student he was constantly 
reminded of the importance of the first two kinds of error and duly vowed 
always to keep sight of them, he was probably never made aware of the existence 
of a third kind of error, let alone told what to do about it. 

The purpose of this paper is to draw attention to the third kind of error by 
quoting actual examples in which the error was made and later rectified. The 
hope is that the paper will serve simultaneously as a warning and as a moderator 
for newly trained consultants who tend to descend on research workers with 
the sometimes frightening enthusiasm and confidence of a freshman at his first 
football practice, and that perhaps it will help stimulate responsible educators 
to move more rapidly in filling this wide gap in graduate statistics training. 
Most conscientious teachers of statistics recognize this need and are searching 
for effective methods of correcting the situation, but very little real progress 
has been made. 

In this connection there is an interesting analogy between graduate statisti- 
cal training and medical training. The physician of today, after he completes 
internship and residency, is well trained to practice medicine but not so well 
trained to do research. This fact is recognized by many schools in which the 
M.D. who wants to do research in physiology is advised to get a Ph.D. in this 
field after he completes medical school. The emphasis in medical schoo! is on 
practice since most medical graduates never see the inside of a research labora- 
tory. The graduate statistician, on the other hand, is for the most part well 
trained to do research in statistics but more often than not inadequately 
trained to “go into practice,” that is, to do statistical consulting. A safe guess is 
that over half of the graduates in statistics each year are lured into industry or 
government where their principal work is consulting, and those who do go to 
universities frequently find their nonteaching time fully occupied with con- 
sulting both on and off campus. It is of utmost importance, therefore, that the 
third kind of error in statistical consulting be emphasized and brought out into 
the open. Otherwise nothing may ever be done about it. 


THE ERROR OF THE THIRD KIND 


A simple and almost ludicrous definition of the error of the third kind is the 
error committed by giving the right answer to the wrong problem. In defining it this 
way we are allowing the statistician the benefit of the doubt by rejecting the 
possibility that he would give the wrong answer to the wrong question. We are 
also protecting ourselves against the occurrence of a false positive, that is, the 
situation in which the wrong answer to the wrong problem turns out to be the 
right answer to the right problem. At this point the reader who finished the 
introduction without succumbing to the temptation to look ahead for a defini- 
tion may well feel like the reader of a murder mystery who on the last page 
discovers that the victim committed suicide. Why, he may ask, should we con- 
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cern ourselves with any consulting statistician who could be stupid enough to 
commit such an error? Admittedly, there may be many mature statisticians 
who prefer to take this attitude rather than face the consequences of accepting 
its alternative. If this is so, the situation is indeed a grave one. 

There is no way of knowing how many of us, particularly in our early years 
as consultants, were guilty of errors of the third kind, but it is almost certain 
that few have escaped an occasional mistake of this nature. The reason is simple 
enough. Many of us, in good faith, have helped research workers make t-tests, 
or compute analyses of variance, or design experiments thinking we were giving 
the right answer to the right problem; and usually we do give the right answer 
to the question that is asked. Unfortunately it often happens that the question 
asked has little bearing on the real problem, and we are led into committing 
the third kind of error. 

A stranger to the intimacies of statistical consulting might well doubt that 
such ridiculous events could ever occur, but the experienced statistician knows 
that they do occur and will probably never be completely eliminated. Basically, 
errors of the third kind are caused by inadequate communication between the 
consultant and the research worker. In some instances, the research worker is 
at fault for failing to discuss his problem in complete perspective. He may feel 
that the statistician is weak in the subject matter field and that any attempt 
at a complete explanation would be a waste of time; or he may not have his 
ideas completely crystallized and may not want to be “confused” by a mathe- 
matician; or he may know a little statistics and feel that he can state the ques- 
tion adequately himself; or he may simply not want to take up too much of the 
consultant’s time. At the same time the statistician is at fault for not becoming 
sufficiently familiar with the problem to enable him to advise intelligently. 
With proper preparation, sufficient patience, and persistent questioning of the 
experimenter, the consultant should be able to avoid most errors of the third 
kind, but not until he recognizes that they exist. In the next section an attempt 
is made to show that such errors can happen and under circumstances that 
ordinarily would not be regarded as unusual or bizarre. 


EXAMPLES OF ERRORS OF THE THIRD KIND 


The material for these examples is drawn for the most part from the author’s 
own experience, with the natural result that most of the problems come from 
the field of biology. The main theme of the paper, however, is not biological 
and except for weakness in the subject matter field, either on the part of the 
author or the reader, the message should be clear. It should not be inferred 
that the errors illustrated are necessarily those of the author, although he 
would not deny this possibility. 


Example I. An engineer was engaged in particle size determinations in -con- 
nection with corrosion studies. He wanted to estimate the particle size distribu- 
tion, which he was willing to assume normal, but his method prevented him 
from observing particle sizes below a certain diameter. He knew very little 
about statistics but he had heard that there were ways of estimating distribu- 
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tions when samples are restricted. There was no statistician in his own group 
to whom he could turn for help, but there was one nearby who, although very 
busy, might give him a reference. 

So he visited the statistician and presented him with the following sample 
of particle sizes: 25.6, 7.1, 5.1, 4.2, 3.7, 3.0, 2.6, 2.0, 1.8, 1.6, 1.5, 1.4, 1.3, 1.2, 
1.1, 1.0, 0.9, 0.8, 0.7—and pointed out that his method would not allow him to 
determine particle sizes less than 0.7. Assuming the distribution normal, he 
wanted to know how he could estimate its mean and variance. The statistician 
was indeed quite busy and not inclined to spend much time on a problem he 
knew very little about and which did not originate in his group. On the other 
hand he did not want to cause any ill feelings by refusing to give any help at 
all. An easy way out was simply to hand the engineer one of his many reprints 
on truncated normal distributions (after all the engineer had asked for a refer- 
ence), and this he did. Both participants in this short conference went away 
happy, the engineer because he thought he had an answer to his problem and 
the statistician because he disposed of an uninteresting problem in short order. 
But, as any reader who carefully inspected the “sample” of particle sizes al- 
ready knows, an error of the third kind was committed. It might easily have 
gone unnoticed indefinitely, as do many others, but fortunately this error was 
caught. 

The engineer returned to his desk armed confidently with the newly acquired 
reprint and began to apply the method with the help of his 1935 model calcu- 
lator. He had not gotten very far along before he found that one of the statistics 
he computed was far outside the range of a key table given in the reprint to 
facilitate solution of the equations. After checking for and finding no arith- 
metical inaccuracies, he reluctantly returned to the statistician who inwardly 
was not too happy to see the engineer back. This conference lasted longer than 
the first, and with great chagrin the statistician finally realized what a stupid 
blunder had been made. 

Among the methods used in particle size determination is one known as the 
sedimentation method. Briefly, it consists of the preparation of a liquid suspen- 
sion of the material to be analyzed and the measurement of the decrease in 
concentration of particles at or above a particular level in the suspension as 
sedimentation proceeds. Under suitable conditions, Stoke’s law can be used to 
compute the percentage of particles in the suspension having diameters greater 
than d, say, where the value of d is determined by the time elapsed after sedi- 
mentation starts. Thus the random variables are the percentages, and d is a 
fixed or independent variate. It was this technique that the engineer had used. 
The appropriate method of estimation is, of course, probit analysis or one of its 
counterparts, and the “truncation” is not a problem except insofar as it in- 
creases the errors of estimate. 

If the statistician had been familiar with particle size methods, or even if he 
had carefully scrutinized the “sample” that was presented to him, the error 
could never have occurred. It might be argued that both parties to this near- 
fiasco were the victims of circumstance and not really responsible, but if we 
are honest we must admit that the statistician has a duty to be more careful 
in avoiding this kind of error than perhaps any other. If he commits an error 
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of the third kind, he is no less at fault than the physician who inadvertently 
administers arsenic instead of aspirin. 


Example IT. A geneticist working in the field of radiation biology became in- 
terested in the relative biological effects of different kinds of radiation. In one 
experiment he hoped to compare the effects of gamma radiation and neutron 
radiation by exposing two groups of organisms separately to graded doses of 
each kind of radiation and then determining the frequency of mutations at each 
dose. In previous experiments it had been found that mutation frequencies in- 
crease linearly with dose, so he planned to evaluate the relative biological 
effect by a comparison of the two slopes for the two kinds of radiation. 

After the experiment was completed, he visited a newly hooded statistician 
and asked him to estimate the two slopes and make a statistical test of the 
difference between them. He explained that the gamma source used in the 
experiment was radioactive cobalt which provided an essentially pure source of 
gamma rays, but that the neutron experiment was carried out in a cyclotron 
and he had “corrected” the neutron doses for a known gamma ray contamina- 
tion of about 7 per cent. The young statistician, who had little or no experience 
with radiation experiments and who at the moment was not particularly inter- 
ested in learning about radiation, proceeded promptly and, as it turned out, 
rashly with his analysis. From the biologist he had obtained the following data: 


Gamma experiment (i=1,-++,n) 
y;= proportion of mutations 
z,=dose of gamma radiation 


Neutron experiment (j=1,--+,m) 
u;= proportion of mutations 
v;= “corrected” dose of neutron radiation. 


Originally there were several replications at each dose point and the statistician 
had carefully tested for homogeneity. Finding no significant departure from 
binomiality, he pooled the replications and proceeded with a weighted linear 
regression for each experiment. He ended up with the two equations 


bw, 


for the gamma and neutron experiments, respectively. Finally he made the 
requested test of significance and chalked up (he thought) another successfully 
completed problem. 

The third kind of error made by this statistician was most certainly avoid- 
able. He had only to question the geneticist about the nature of the “correction” 
of the neutron dose, and without having to learn much at all about radiation 
dosimetry, he would have discovered his error. The consulting statistician, 
particularly in the physical science and engineering fields, soon learns to ques- 
tion any “corrections” applied by the experimenter before the data are pre- 
sented for analysis. In the problem at hand it turned out that the geneticist had 
simply reduced the original neutron dose by 7 per cent intending thereby to 
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evaluate the effect of neutrons uncontaminated by gamma rays. Overlooked 
was the fact that the corresponding biological effect still included the gamma 
component. When the error was uncovered, a somewhat different approach was 
taken. The two experiments were analyzed simultaneously by minimizing 


> — Ji)? + > — 
j=l 


where 
= a’ + b,'z; 
a; = a’ + b,'(0.07w;) + b,'(0.93w,), 


where the uncorrected neutron doses (w;) were determined from the relation, 
v;=0.93 w;, and where \; and »; are the appropriate weights. Needless to say, 
the second approach yielded estimates and standard errors somewhat different 
from those of the first approach, and the new significance test had to allow for 
the covariance between b,’ and b,’. 

Once again in this example the blame must rest primarily with the statisti- 
cian. Perhaps in his eagerness to apply his newly acquired skills to a problem 
which he thought fell into a pattern he had seen in graduate school, he tempo- 
rarily lost his common sense. Whatever the explanation it is hard to draw any 
conclusion other than one which reflects the fact that he was just not ready to 
do statistical consulting on his own. 


Ezample III. This example illustrates in a sort of general way a situation 
which must occur many times in the life of every consulting statistician. It 
might be called “Consulting by remote control,” or “Communication without 
representation.” Frequently the situation arises in a manner similar to the one 
in this example. 

A research worker who, mostly through experience, had become fairly adept 
with many text-book statistical methods, encountered a problem which was 
new to him and which he could not find in his elementary text-book. He had 
computed two product-moment correlation coefficients and wanted to test the 
hypothesis that the population correlations were equal. He was reasonably sure 
that the t-test would not be appropriate, but he was also sure that some method 
must exist. The research organization to which he belonged did not employ a 
statistician, but he had a statistician friend in the same city who he felt would 
certainly have the answer. For such a minor problem the trip across town was 
hardly worthwhile, but thanks to Alexander Graham Bell, he knew he could 
solve his problem without leaving his desk. The phone call was made and the 
statistician, not wanting to be impolite or difficult by suggesting a meeting in 
person, and being allergic to long telephone conversations, quickly told his 
friend about the z-transformation and where to find an example of its use. 

Sometime later both men happened to attend the same local seminar, and 
upon seeing his friend, the research worker rushed over to thank him for the 
useful advice about the z-transformation. During the course of the conversa- 
tion, the statistician discovered to his horror that the experimenter had taken 
N simultaneous observations on three mutually correlated variables, z, y and 
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z, and the two correlation coefficients which had been the subject of the afore- 
mentioned telephone conversation turned out to be the correlations between x 
and z and between y and z. With much embarrassment he realized that he had 
recommended a t-test between two z-transformed correlation coefficients 
which were not independent. Summing up all his courage he confessed his 
mistake and referred the experimenter to the paper by Hotelling [1 ] in which it is 
shown that under the null hypothesis, p,,.= pys, 


VN — Tye) V1 + 
V2D 
is distributed approximately as “Student’s” ¢ with N—3 degrees of freedom, 
where 


fe tes 
D => 1 Ty: 
Too To 


The experimenter tried to accept the blame for this mistake contending that 
he should have taken the time to explain the actual problem more completely. 
Actually in this error of the third kind it would appear that both parties were at 
fault and for essentially the same reason—neither wanted to take the time to 
find out what the other was really doing. 


Example IV. It seems desirable to include, as one of the examples of errors 
of the third kind, an error of omission. Essentially these errors occur when the 
statistician fails to do the best job possible simply because he has not taken 
enough time to question the research worker thoroughly about his experiment. 
In these cases, the answer given is often the right answer to the right problem 
but not always the best right answer. The following example illustrates an error 
of this kind. 

A geneticist was engaged in a series of recombination experiments with 
bacteriophage T4. He was interested in testing for independence of the occur- 
rence of two markers, r and tu. Under the hypothesis of independence, in an 
experiment in which plaques are counted for all four types of progeny, the ob- 
served and expected plaque counts can be represented as shown in the following 
table: 

PLAQUE COUNT FREQUENCIES 


Type of Progeny 
Frequency Total 
Parental rt tut | r*tut 
Observed a a M 


where p: and pe: are the probabilities of events leading to recombinants 
r+ and tu*, respectively, and q.=1—1, g.=1—p2. Typical experiments of this 
type yield about 90 per cent of parental type progeny and 10 per cent re- 
combinants. 
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The geneticist who was doing these experiments had had some experience 
using chi-square in testing for independence with genetic frequency data, but 
since there were two parameters to be estimated in this case, he was not quite 
sure how to proceed. So he visited a young biometrician and presented him with 
data of the type shown in the above table. After explaining the experiment, he 
mentioned casually that he had much more data from another replication of 
this experiment but that it would probably be of little use since not all of the 
four classes of progeny were counted. 

Perhaps it was too early in the morning, or perhaps the biometrician had his 
mind on something else. In any event he ignored the experimenter’s casual 
remark about the other replication, proceeded to obtain maximum likelihood 
estimates of the parameters p; and p, from the complete experiment and cor- 
rectly computed a chi-square wlth one degree of freedom which provided the 
required test for independence. 

The results of the test were somewhat inconclusive, at least in the mind 
of the experimenter, and he began to reflect on why he had done the second 
replication in the first place. The greatest labor in experiments of this type is 
the counting of plaques, and since about 90 per cent of them represent parental 
type progeny, most of the work is done in counting plaques which provide little 
information about independence. It seemed reasonable to him, therefore, to do 
an experiment in which only the recombinants were counted. This was the 
second replication which he had mentioned to the statistician and it was about 
twice the size of the first. 

With these points in mind he returned to the statistician and asked specifi- 
cally if there wasn’t some way in which the information from the second replica- 
tion could be combined with the first so as to provide a more sensitive test for 
independence. As a result of this gentle prodding by the experimenter, who was 
obviously thinking more clearly than our young biometrician, an approach was 
found which would make use of all the data. The result of the second experi- 
ment was representable as: 


PLAQUE COUNT FREQUENCIES 


Type of Progeny 
Frequency Total 
Parental tu+ r+tu+ 
Observed as as N 
Naip2 Npips 
E ted — N 
(1 —qig2) (1 —qig2) (1 —qige) 


Under the hypothesis of independence the joint probability of both samples is 


M! 


x 


!a¢!a7! 


(1 — pipe)”. 
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The maximum likelihood equations for p, and p2 can be reduced to a quadratic 
equation in p: with only one admissible root, and an equation in p, which is 
linear in pz. A chi-square with three degrees of freedom is then easily computed. 
In this particular experiment the added strength of the second replication was 
sufficient to convince the geneticist that he had no reason to suspect lack of 
independence, whereas the significance level of chi-square based on the first 
replication alone had left him in doubt. 

Perhaps there are only a few young statisticians who would commit an error 
of this kind, but the temptation must be great in many practical situations for 
the new consultant to discard extra observations which make the pattern of an 
experiment look different from what he has been accustomed to seeing in class 
examples. We so often hear it said that many research workers never come to 
the statistician until after the experiment is completed, and that frequently 
much of the data is worthless for statistical analysis. Certainly this does happen 
more often than it should, but in many apparently hopeless cases it also hap- 
pens, as in the foregoing example, that a little extra effort on the part of the 
consultant will yield a workable, relatively simple method of analysis. A feel for 
these situations comes only with experience, but the graduate student should 
be given a chance to get some of this experience before he starts out com- 
pletely on his own. 


A POSSIBLE SOLUTION TO THE PROBLEM 


Many readers may object to the examples which were chosen to illustrate 
errors of the third kind as being unrealistic and unlikely to happen in actual 
practice. To a large extent they are right because all of the errors discussed 
were eventually corrected and hence no longer qualify as errors. But it should 
be obvious that the only errors of the third kind which become known are those 
which are corrected, and for every one which is corrected there must be many 
which we will never know about. If we are ready to admit that these errors are 
committed and perhaps in large numbers, then we should also be ready to do 
something about it. 

The obvious place to start is in graduate schools where degrees in statistics 
are awarded to students who expect to do statistical consulting. For some time 
to come these institutions will provide the largest part of the supply of consult- 
ing statisticians. If the consulting statistician were required by law to obtain a 
license before he could go into practice, we could take our cue from the medical 
profession. Every statistics graduate who expects to consult would be required 
to intern for, say, one year, and at the end of this time would be required to 
take an examination to obtain his license. This arrangement might or might 
not prove satisfactory but most people would admit that it is not practicable, 
at least not in the foreseeable future. 

Let us turn then to the teaching profession. In many states licenses to teach 
are either not required or can be obtained merely by payment of a fee, and the 
teachers colleges, in addition to providing a comprehensive curriculum of course 
work must somehow prepare students for actual teaching. They accomplish 
this by the long established requirement of practice teaching. Every conscien- 
tious teachers college includes as part of its curriculum a period in which the 
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student leaves the campus and under the direction of an experienced teacher 
learns to teach by teaching. In some schools practice teaching begins at the 
junior level, and college administrators have found that there is absolutely no 
substitute for it. Why then should not the statistics student be required to 
learn to consult by consulting? 

Some statistics departments have attempted to achieve this geal by having 
the student “sit in” on consultations held by members of the staff. This un- 
doubtedly helps to some extent, but frequently the student participates very 
little in the discussion and some staff members complain that their clients are 
reluctant to talk in the presence of graduate students. Whereas attendance at 
staff consultations may serve to introduce the student to the complexities of 
consulting, he can never learn to cope with them until he tries it on his own. 
To achieve this opportunity it is imperative that he leave the campus and 
“intern” in the field. 

Exactly how this can best be accomplished is anybody’s guess. As a start it 
would seem that graduate schools should attempt to obtain affiliations with 
consulting groups in government and industry, much as medical schools are 
affiliated with hospitals, or teachers colleges with practice schools. Universities 
contribute heavily to government and industry through the medium of the 
research contract. Both parties benefit, of course, even under the present sys- 
tem, but certainly both would benefit more in the long run if programs of stu- 
dent participation could be arranged. There must be many instances in which 
essentially this sort of arrangement has been made and proved successful, but 
only for an isolated student here and there. To be really effective such a pro- 
gram would have to be made an integral part of the graduate curriculum and 
listed in the catalog as one of the requirements for a degree. 

Those of us in the profession of statistical consulting who take honest pride 
in our work face a real challenge. Two avenues are open to us. One is to ignore 
the presence of this situation and to continue along our narrow paths of indi- 
vidual self-satisfaction, oblivious of the effect it might have on the future of 
our profession. If this course is followed, when the production rate of new 
statisticians begins to catch up with the demand, we will face loss of prestige 
and public confidence, and possibly even virtual extinction. The other avenue 
is to recognize the problem, to appreciate that it is constantly increasing in in- 
tensity and to push hard for positive action as soon as practicable. We should 
have begun yesterday; today we are only thinking about it; tomorrow we 
must act. 
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PREFACE 


uE following article was originally published as a series of eight brief in- 
a in the United States Air Force Statistical Services Technical 
Letter. The Tech Letter is a monthly publication of the Directorate of Sta- 
tistical Services, Hq USAF and is used to disseminate technical information 
on current problems and future plans of all Statistical Services units in the Air 
Force. It is read by an estimated 5,000 Statistical Services officers, key civilians 
and key airmen, world-wide. 

The Statistical Services organization of the United States Air Force is re- 
sponsible for gathering, consolidating, and processing data for virtually all Air 
Force activities. It is easy to envisage the potentialities of scientific sampling 
in so vast an organization, and the Directorate has for the past several years 
explored some of these possibilities. In order to generate greater acceptance of 
this widely misunderstood statistical technique, the author conceived the idea 
of a short series of articles on the subject for publication in the Tech Letter. 
The purpose was not only to instruct and indoctrinate Statistical Services 
personnel but also, through them to help convince Air Force Commanders and 
members of their staffs at all echelons of command that sampling, properly 
conducted, could yield highly satisfactory results with savings in time and 
labor. 

While a few Statistical Services personnel have graduate degrees in statistics 
and a goodly number have had an elementary course or two in statistics, most 
were believed to be unfamiliar with the subject of sampling. In addition, other 
elements of the Air Force at all levels had little familiarity with the subject. 

To make the subject understandable to this audience would, it was believed, 
require a very elementary approach, with avoidance of any technical or mathe- 
matical material. A second requirement was brevity, both because of the 
necessity of avoiding reader fatigue and because the Tech Letter is restricted 
as to size and must cover a wide variety of subjects each month. Furthermore, 
to capture and maintain the interest of this audience it was decided to use a 
light approach, yet avoid, if possible, the impression of flippancy lest the reader 
end by not taking the sampling technique seriously. Upon publication of the 
series of articles it was evident that the readers of the Tech Letter welcomed 
the popularized approach. What was more surprising was that the more sophis- 
ticated readers, statistically speaking, received it equally well. 

The intent of the series was to provide in very brief compass a general survey 
of current sampling procedures. Accordingly, the topics selected were intended 
to cover a rather wide range of the sampling field, in order to serve the purpose 
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of the series. It was the author’s opinion that at a minimum the audience should 
be told the meaning of sampling, its potentialities, the principal types of sample 
design and their differences, the use of varied types of estimating procedures to 
improve sample estimates, and precautions to be observed in sample selection 
and data gathering. In addition, it was believed instructive to mention to this 
Air Force audience the steps taken by the Directorate of Statistical Services 
in organizing for sampling work, including some examples of sample surveys 
completed or under way. 

Following a number of requests, the series was consolidated in the United 
States Air Force pamphlet Sampling in a Nutshell. The favorable response 
to this brief publication by various government agencies, business organiza- 
tions, private individuals and university teachers reflects a much wider interest 
in the subject of sampling than the author had envisioned. More particularly, 
it reflects a considerable demand for presentation of this technical subject at 
the most elementary level. 

It is not contemplated that readers of the Journal will learn anything they 
did not already know about sampling from this article. Nonetheless, it is hoped 
that “Sampling in a Nutshell” will prove of interest and of value to many 
Journal readers. Most statisticians are faced, at one time or another with the 
need of explaining to laymen in their organizations the significance and po- 
tential value of statistical methods. The current article has been effective in 
explaining one area of statistical methodology to an Air Force audience. Those 
statisticians who find it necessary to acquaint other audiences of statistical 
laymen with the subject of sampling may be able to make direct use of the 
article. Others may have the problem of disseminating knowledge of statistical 
methods other than sampling. For these, it is hoped that the methods, devices, 
and general approach will prove of some benefit. 

Two earlier articles on sampling appearing in the Journal represent ap- 
proaches to different audiences. These are the papers on “Principles of Sam- 
pling” by Cochran, Mosteller, and Tukey (March 1954) and “The Applica- 
tions of Sampling Procedures to Business Operations” by Howard L. Jones 
(September 1955). It is evident that neither of these articles was aimed at 
statisticians in the field of sampling. The Cochran-Mosteller-Tukey article was 
intended for social scientists. However, one need not be a social scientist to 
make effective use of this lucid exposition on sampling. The Jones article, while 
directed primarily at telephone company personnel, can prove of service to 
sampling statisticians in many other areas of activity who have the task of 
explaining sampling fundamentals to business men. 


FOREWORD 


HE increased use of sampling in the Air Force during the past few years 
Ti due to the proven value of this statistical tool in a number of practical 
situations. Statistical Services personnel at all echelons should have a basic 
understanding of the elementary principles of sampling. This will lead to greater 
acceptance of a technique which, when properly used, can be of real assistance 
in supplying Air Force Commanders with accurate and timely data at a mini- 
mum cost. 
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“Sampling in a Nutshell” provides an elementary review of sampling that is 
concise yet covers a broad area of this technical field; moreover, it does so in a 
manner that should make it of especial interest to the non-technician. 

First published as a series of articles in the Statistical Services Tech Letter, 
“Sampling in a Nutshell” has been reprinted in its present form as a result of 
interest expressed by a number of Air Force Activities, other government 
agencies and universities. 

L. O. Ryan 

Brigadier General, USAF 
Director of Statistical Services 
DCS/Comptroller 
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CHAPTER I. INTRODUCTION 


VERYONE who has poured a highball into the nearest potted plant after 

taking one sip has had some experience in sampling. The abstemious reader 
doubtless has at one time or other pushed aside a bowl of tepid mush after 
swallowing a spoonful. He, too, has unwittingly employed a sampling technique. 
It is not necessary, one perceives, to have a graduate degree in mathematics 
to become reasonably proficient in sampling matters, in a practical sort of way. 
Naturally, one needs more than a stirring rod or a soupspoon to become a 
sampling expert. However, the treatment of this subject has frequently been at 
too stratospheric a level. Complex formulas and jawbreaking terms have been 
piled high and deep (whence the abbreviation Ph.D.) to the vexation and con- 
fusion of the reader. 

There is a tendency among some statisticians, as there is among some law- 
yers, doctors, economists, and the like to use technical terms to excess. Many 
laymen suspect that this is done to impress or confuse, or perhaps to justify a 
higher fee. Whatever the purpose, this practice is unnecessary in many cases. 
Such statistical terms as “leptokurtic,” “homoscedasticity” and “interpene- 
trating replicate sub-samples” may be very descriptive; however, it is mildly 
deplorable that statisticians have not coined simpler expressions for most of 
these technical terms. Few fishermen, for example, would refer to a “reticulated 
lattice joined at the interstices” ; while highly descriptive, most of them use the 
simpler “net.” 

To be sure, the origins of sampling lie in probability theory. It is true, also, 
that the development of sampling theory requires a rather thorough mathe- 
matical training. Nevertheless, it is believed that the subject can be presented 
in a sufficiently non-technical manner to provide the so-called layman with a 
fair basic understanding of it. “Sampling in a Nutshell” will present the subject 
at the whale-bone or ocean-bottom level. Those whose number sense is suf- 
ficiently keen to enable them to count all the way to ten without removing 
their mittens should be able to manage, while those who wangled a C+ or 
better in high school algebra will find it ridiculously simple (or vice versa). 

Although sampling is constantly gaining greater acceptance in private in- 
dustry and in government, many are still skeptical if not downright suspicious 
of the accuracy of any figures obtained from a partial rather than a complete 
count. In part, this reflects a natural reluctance to accept the evidence pro- 
vided by a small number as correct for the entire group under study. In part, 
it is due to some careless applications of sampling methods in the past. 

Nore: Although tempted, we shall not use the very much overworked example of 
a certain publication that expired following a presidential election poll that turned 


out disastrously. If every writer who has cited that incident as an example of im- 
proper sampling were to contribute one dollar, the magazine might be able to resume 


operations. 


Many are unaware that for certain types of sampling methods the degree of 
precision desired from the sample estimate can be stipulated in advance. More- 
over, it is very interesting to note that the precision of a particular sample of 
such a type can be estimated from the sample itself. In addition, many who 
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insist that the only accurate way is to make a complete count, overlook the 
fact that often there are many sources of error in the basic data and that a 
100% count can be highly erroneous, as well as nearly impossible to achieve. 
In fact, a sample can sometimes yield more accurate results than an attempted 
complete count because the sources of error can be controlled more effectively 
when only a relatively small number of items are to be examined. 

An example of this is an estimate of the value of the inventory on hand for 
some 500,000 consumption-type items. The estimate was derived from a sample 
of some 60,000 items, and was about 8% less than the total value obtained 
from a complete count of the half-million items. However, basic records were 
checked thoroughly for the 60,000 items in the sample and many revisions 
were made. No similar check was made for the remainder of the 500,000 items. 
Consequently, there is more than a slight possibility that the sample estimate 
is closer to the true value than is the total count. 

No doubt about it—sampling is here to stay. Judging from the experience 
of the past decade, the use of sampling will continue to expand at a rapid pace. 
It is already being used to a moderate extent in the Air Force and seems very 
likely to assume much greater importance in Air Force activities in the future. 
Sampling is an extremely useful tool with which all Statistical Services person- 
nel should have at least a nodding acquaintance. 


CHAPTER II. SIMPLE RANDOM SAMPLING 

1. General 

Essentially, sampling consists of obtaining information from a portion of a 
larger group or “universe.” This universe represents the entire lot on which 
some item or items of informatior. are desired. The universe we desire to sample 
may be a large kettle of small fish; ail the apes who have wandered into apiaries 
by mistake in 1954; or the number of USAF troops wearing 15 EEEE shoes. 

The circumstances in a particular problem will determine the sampling 
method or combination of methods to be employed. A variety of methods is 
available and the principal ones will be covered in this and subsequent chapters. 


2. Simple Random Sampling 


A candy manufacturer has a crate of assorted nuts weighing 500 pounds and 
wants to know how many nuts there are to the pound before he daubs on the 
chocolate coating. He could, of course, hire several small boys with heartburn 
to count the mountain of nuts and then divide the total number by 500 to 
obtain the number per pound. Or, he could select a few hundred nuts at random 
from the crate, count them, weigh them and divide the number by the weight 
in pounds. If the nuts were thoroughly mixed before each selection the sample 
would likely yield an acceptable result. The important thing to remember is 
that every nut should have the same chance of being selected in the sample. 
That is the prime requirement for simple random sampling. 

To see how this would operate on a small scale, let us draw a simple random 
sample of 6 nuts out of a universe of 18—the 18 consisting of 3 walnuts, 6 fil- 
berts and 9 peanuts. Let us number the 18 nuts and assign hypothetical 


weights (in milligrams) to each. 
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Walnuts Filberts Peanuts 
Nut Weight Nut Weight Nut Weight 
Number (in Mg.) Number (in Mg.) Number (in Mg.) 
i 55 4 27 10 8 
2 67 5 32 ll 12 
3 43 6 24 12 8 
7 28 13 ll 
8 31 14 7 
a) 26 15 9 
16 7 
17 10 
18 9 
Total 165 168 81 
Average 55 23 9 
Grand Total Weight (18 Nuts)—414 Mg. 
Average Weight — 23 Mg. 


To make a random selection of 6 out of the 18 nuts we shall use the following 
table of random numbers and select the first six different numbers between 


01 and 18. 
List of Random Numbers 
22 57 53 93 
19 48 40 21 
16 61 02 95 
78 36 95 97 
03 18 38 
93 RS 16 04 
78 09 77 61 
23 12 46 85 
16 85 37 21 
38 38 61 15 


Nors: If the reader has occasion to use random numbers, we suggest he resort to published lists rather than 
attempt to pick numbers haphasard’y out of his own head. Lista of random numbers ars not quite that easy to pre- 
pare and must satisfy a number of mathematical tests before they can qualify as truly random. 


Ranpom SamPie 


Nut Number Weight (in Mg.) 
16 7 
3 43 
15 9 
18 9 
9 26 
12 8 
Total 102 
Average 17 (compared with 23 for all 18 nuts) 


This is just one sample out of 18,564 possible different samples of 6 each 
from the universe of 18 nuts. All possible samples of 6 nuts each could be ar- 
ranged systematically in some such manner as the following: 
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1416 6 17 18 21415 16 «17 18 + 


It can be proved mathematically that the average weight of the 18,564 ran- 
dom sample averages amou.ts to 23 Mg., the same as the average of the original 
18 nuts in our universe. 

An undesirable feature of simple random samples is that every so often we 
may select one whose average is pretty far off from the true average. Thus, 
instead of the sample we actually drew (average weight 17 Mg.) we might have 
come up with either one of the following two samples: 


Sample A Sample B 
Nut Number Weight (in Mg.) Nut Number Weight (in Mg.) 

1 55 10 8 
2 67 12 8 
3 43 14 7 
5 32 15 9 
7 28 16 7 
8 31 18 y 

Total 256 Total 48 

Average 423 Average 8 


Clearly, the estimates of the average weight of our universe of 18 nuts as 
obtained from either of these two samples are pretty poor. While the risk of 
selecting such a very poor sample is small, there is available a sampling tech- 
nique that automatically precludes the selection of such lopsided samples. This 
technique, called “Stratified Sampling” will be described in Chapter III. 


CHAPTER III. STRATIFIED SAMPLING 
1. General 
One way of improving the estimate obtained through simple random sam- 
pling in Chapter II would be to arrange our universe of 18 nuts into strata, or 
layers. A fairly obvious method of stratification would be to group the nuts into 
their separate varieties—walnuts, filberts and peanuts. We can then take a 
random sample from each stratum. 


2. Proportional Sampling 


To obtain a sample of six, we would select one of the three walnuts; two of 
the six filberts; and three of the nine peanuts (i.e. one-third of the nuts in each 
stratum). We shall use the short list of random numbers in Chapter II to 
select the first number between 01 and 03; the first two between 04 and 09; 
and the first three between 10 and 18. 
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Stratum Nut Number Weight (In Mg.) 

Walnuts 3 43 
Filberts y 26 
4 27 
Peanuts 16 7 
15 g 
18 9 
Total 121 

Average 203 (Compared with 23 

for all 18 nuts) 


Since each nut had the same chance of selection in our sample (one in three), 
it is unnecessary to weight the separate strata results to obtain the average 
weight of the six nuts. 

One would intuitively expect a better estimate (in general) from the above 
stratified sample than from a simple random sample of six nuts. This is because 
we are certain to have nuts of each variety in our stratified sample, thereby 
avoiding such “wild” samples as six peanuts. In the stratified sampling method 
above, there are 3,780 possible samples of six each, compared with the 18,564 
samples of six each possible in simple random sampling (as noted in Chapter IT), 


3. Equal Size Samples 


This method would entail selection of two walnuts, two filberts and two 
peanuts for our sample of six. Once again we use the list of random numbers 
to select the nuts in each stratum. 


Srratiriep Sampte—“Equat 


Stratum Nut Number Weight (In Mg.) Weighting Factor Weighted Totals 
Walnuts 3 43 
Walnut Total 2 67 _ -- 
_ 110 3/2 165 
Filberts 4 26 
4 27 _ 
Filbert Total _ 53 6/2 159 
Peanuts 16 7 
15 9 
Peanut Total _- 16 9/2 72 
Grand Total 396 
Weighted Average (Total +18) 22 (Compared with 23 for 
our universe) 


Nore: Obviously, if we select } of the walnuts in the universe but only 4 of the filberts and 2/9 of the peanuts 
we must adjust for these varying proportions to obtain an acceptable result. The so-called weighting factor, above, 
is the inverse of the sampling proportion (e.g., we selected } of the walnuts, hence, our factor is 3/2). 
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4. Optimum Allocation 


As the name so cleverly implies (to those of you who have breezed through 
Horace and Ovid) this method affords the best possible subdivision of our total 
sample into individual strata. That is to say, formulas have been developed 
that tell us the size of sample for each stratum that will yield the most precise 
estimates (on the average, that is) for a particular total size of sample. In this 
case, as with equal size samples for each stratum, it is necessary to adjust for 
varying proportions of the sample in the separate strata. 

While the formulas for determining the number of nuts to select in each of 
our three strata are beyond the scope of this series, the “optimum” method in 
this instance, upon solution, tells us to pick all three walnuts, two filberts, and 
one peanut. Using the list of random numbers to select the two filberts and the 
peanut, we obtained a weighted average figure of 214 Mg (vs 23 for our uni- 
verse). The more eager of our readers may find it educational to check this 
figure. 

The optimum allocation method in stratified sampling can be extended to 
situations where the cost of conducting the sample survey varies from stratum 
to stratum. Thus, given a fixed budget for the project, the most efficient way 
of subdividing the total sample by strata can be determined; or, if a certain 
degree of accuracy is stipulated in advance, the size of sample and its break- 
down by strata can be determined so as to yield the lowest possible cost that 
will achieve the desired accuracy. 


CHAPTER IV. CLUSTER SAMPLING 


Another way of selecting a sample of six nuts from the 18 in our lilliputian 
universe would be to group the 18 into non-homogeneous clusters or piles. We 
can do this by making up three clusters, each containing one walnut, two 
filberts and three peanuts. 


Cluster I Cluster IT Cluster III 
Nut Weight Nut Weight Nut Weight 
Number Variety (in Mg.) | Number Variety (in Mg.) | Number Variety (in Mg.) 
1 Walnut 55 2 Walnut 67 3 Walnut 43 
4 Filbert 27 6 Filbert 24 2 Filbert 31 
5 Filbert 32 7 Filbert 28 9 Filbert 26 
10 Peanut 8 13 Peanut il 16 Feanut 7 
ll Peanut 12 14 Peanut 7 17 Peanut 10 
12 Peanut 8 15 Peanut a 18 Peanut a 
Total 142 146 126 
Average 234 244 21 


Again using the short list of random numbers (which was printed in Chapter 
II), the first number between 01 and 03 turns out to be 03, hence Cluster III is 
selected for our sample. The average weight of the nuts in this sample is 21 Mg, 
compared with the Universe average of 23. 

In general, the most precise results under cluster sampling will be obtained 


3 
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when each cluster contains as varied a mixture as possible, while one cluster 
is as nearly alike another as possible. The reason our cluster sample estimate 
above was so good is that the nuts in each cluster were “all mixed up” while 
each of the three clusters resembled the others closely (each had the same num- 
ber of walnuts, filberts and peanuts). 

We noted in Chapter III that the criteria for effective stratified sampling 
are just the reverse—strata as homogeneous as possible internally with the 
strata differing from one another as much as possible. Our example met these 
criteria nicely; each stratum comprised a separate variety of nuts, and the 
three varieties were quite different from one another (with regard to the average 
weights). 

While stratification, if performed with a modicum of judgment will almost 
always yield a more precise estimate than a simple random sample, cluster 
sampling generally gives a less precise estimate than a simple random sample 
of the same size. The reason is that in most practical situations one must accept 
the clusters as they exist and often each is relatively homogeneous, while 
differing from all other clusters. As a matter of fact, the use of cluster sampling 
is generally dictated by cost and administrative considerations. 

Suppose, for example, that the Double-Jointed Peanut Company wanted to 
know the average annual consumption of that delicacy per household in Wash- 
ington, D. C. If a list of all households in the city were available it could select 
a sample of, say 2,000 households at random and obtain the average consump- 
tion figure from this sample. It would be much simpler and cheaper, however, 
to select at random some 100 city blocks averaging 40 households each, and 
enumerate these blocks completely. 

In general, the households in a single block are fairly homogeneous with re- 
spect to income (compared with a block in another part of town) and the block 
residents may well be similar in their performance as goober gourmands. Thus, 
our clusters would be homogeneous. Nevertheless, the sample of 100 blocks 
containing 4,000 households might readily provide as accurate a result as the 
widely dispersed random sample of 2,000 households, and at a lower total cost. 
While the random sample is more precise, size for size in the above example, 
the larger cluster sample is more efficient, dollar for dollar. 

In actual practice sampling entire clusters is not too common. More fre- 
quently the sample is selected in two or more steps or stages. In two-stage 
sampling, clusters are selected at random and a subsample is drawn at random 
from each cluster. The clusters in such a sample design are termed primary 
sampling units (or p.s.u’s) while the subsample elements are called elementary 
units. As an example, we might select 100 one-pound bags of assorted nuts 
from a universe of 1,000 such bags and then select 25 nuts from each of the 
100 bags. The bags or clusters are the p.s.u’s and the nuts are the elementary 
units. 

Cluster sampling is often feasible in preparing sample estimates from data 
contained in a large volume of punch cards. If the cards are stored in a number 
of drawers, each drawer can be considered a cluster. Then several drawers can 
be picked at random and either all or a designated fraction of the cards from 
each drawer can be selected for the final sample. 


* 


SAMPLING IN A NUTSHELL 
CHAPTER V. OTHER SAMPLING METHODS 
1. General 


Our surviving readers will doubtless be gratified (and relieved) to learn that 
the previous chapters have already covered most of the important types of 
sampling—simple random, stratified and cluster. There are other sampling 
methods, however, and every self-respecting, well-rounded Statistical Services 
type should have at least a superficial knowledge of these. The following treat- 
ment, it is hoped, will prove sufficiently superficial for most purposes. 


2. Systematic Sampling 

Returning to our 18-nut universe once again, we could select a sample of six 
nuts in the following way. First, select at random a number between one and 
three. Let us say the number selected is two. We would then include nut num- 
ber 02 and every third nut thereafter in our sample. Thus, our sample would 
consist of nuts # 02, 05, 08, 11, 14 and 17. This method is called systematic 
sampling and is used frequently because it is simple, direct and inexpensive. 
When a list of names or items is available systematic sampling is often an 
efficient approach. In selecting a sample of punch cards from a number of 
drawers full of cards a ruler is sometimes used to select one card per inch, say. 
The sample so obtained may be treated as a systematic sample for practical 
purposes. Systematic sampling is also referred to as patterned or serial sam- 
pling. 
3. Double Sampling 


A fairly common sampling technique is to select a large sample of items or 
persons in a rapid, inexpensive manner and use the information from this large 
sample as a basis for selection of a small sample for more intensive study. For 
example, a simple, brief questionnaire could be used to obtain data on the oc- 
cupational distribution of a large population from a sample of 10,000 persons. 
A very comprehensive questionnaire could then be sent to a sample of 1,000 of 
these 10,000 persons, using the proportions of each occupational group in the 
larger sample for selecting the sample of 1,000. This type of sampling is called 
double sampling. One foreign sampling expert has found the budgetary impli- 
cation of the term “double sampling” to be a drawback in seeking funds for 
such sampling projects. He has therefore used the term “half-sampling” with 
some success. Since the final sample used to obtain the desired information is 
generally much smaller than the first, large, superficial sample there is ample 
justification for this fiscal euphemism. Double sampling is also referred to as 
two-phased sampling. 


4. Interpenetrating Replicate Subsamples 

As noted in Chapter I, we decry the use of such sesquipedalian terms as the 
above as much as the reader decries the use of the word sesquipedalian. In our 
opinion, the term “independent subsamples” would be adequate to describe 
this type of sampling. 

Interpenetrating replicate subsamples, to use the professional designation, 
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are useful in measuring the degree of agreement of the results of successive 
samples. They are also useful in obtaining crash estimates. One subsample can 
be used for this purpose prior to calculation of a final estimate from the entire 
sample. A further use is to. check one enumerator against another by giving 
each a separate subsample to work on. As an illustration of an i.r.s., suppose 
we had 1,000 one-pound bags of walnuts numbered 1 to 1,000, consecutively. 
We might select 10 different samples of 10 bags each at random and select 
10 walnuts from each bag at random. If we put the 100 walnuts from the first 
subsample of 10 bags into the Ist of 10 baskets, put the 100 walnuts from the 
second subsample of 10 bags into the 2nd of the 10 baskets, etc., the 10 baskets 
of 100 walnuts each would represent 10 interpenetrating replicate sub-samples. 


5. Sequential Analysis 

One important method of acceptance sampling in quality control work should 
be noted briefly. This is sequential analysis, and was developed around 1945. 
It differs from other types of sampling in that successive units are examined 
and the decision to accept an entire lot, reject it, or examine the next unit is 
made after each successive unit is examined. This method has proved very 
efficient and economical in verification of punched IBM cards, for example. 
High levels of accuracy are maintained even though only a very small propor- 
tion of the cards are verified. 


6. Combined Sampling Methods 


In most sample surveys a combination of sampling methods rather than one 
method is used. ; 

For example, an Air Force sample survey might entail setting up a number 
of strata (major air commands) each of which includes several clusters (sub- 
commands); selecting clusters from each stratum with optimum allocation; 
drawing a proportional sample of segments (wings) from each cluster; selecting 
a random sample of sub-units (squadrons) from each segment; and drawing a 
systematic sample of elementary units (aircraft) from each sub-unit. 


CHAPTER VI. ESTIMATING PROCEDURES 
1. General 


The truly avid reader should have acquired by this time a thorough mastery 
of the assorted sampling methods covered in previous chapters. One would 
normally expect that this knowledge would be sufficient to insure sample esti- 
mates par excellence. However, in order to keep ribbon clerks from overrunning 
the profession, statisticians have artfully devised a bewildering array of esti- 
mating procedures. These are designed to yield better estimates from the same 
basic sample data, and (surprisingly) often do exactly that. Although some 
Stat Services experts have concocted their own original estimating techniques 
that are guaranteed to get accurate answers from inaccurate data the following 
paragraphs will be limited to a discussion of some of the less original types of 
estimating procedures. 
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Suppose we wanted to estimate total hours flown by the ten Z-99 squadrons 
in Baker Command in August. Suppose, further, that we had decided to make 
our estimate from a simple random sample of three squadrons. Hypothetical 
figures for both July and August for all ten squadrons are listed below in order 
to illustrate the results of the several estimating procedures. 


Hours Flown 


Sample Total 


2. Blowup Estimate 


The most obvious estimate of total flying time for August ‘s the blowup, or 
simple unbiased estimate. This is obtained by applying the reciprocal of the 
sampling ratio to the August sample total. Since we selected 3/10 of all the 
squadrons for our sample, our blowup estimate is 10/3 X2,688 = 8,960 hours. 
By comparison, total flying time for August, in this illustration, was 10,000 
hours. 


3. Ratio Estimate 


One very useful method of estimation that often yields more precise results 
than the blowup estimate is the ratio estimate. If we have available the flying 
time data for all squadrons for the previous month, the ratio estimate can be 
used and is obtained as follows: 


Ratio Estimate Flying Time for Sample Squadrons, August Flying Time 


of Total Flying= for All Squad- 
Time, August Flying Time for Sample Squadrons, July rons, July 


Squadron No. 
July August 
1,179 1,227 
714 885 
1,076 1,145 
763 843 
874 913 
1,191 1,284 
483 651 | 
745 890 
1,272 1,370 
703 792 
9,000 10,000 
Our random sample consists of Squadrons 2, 5, and 8. 
Hours Flown 
Squadron No. 
July August 
2 714 885 
6 874 913 
745 890 
2,333 2,688 
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2,688 
In the present instance, our ratio estimate = 2333 X 9,000 = 10,369 hours. 


? 
4. Difference Estimate 


Still another method of estimation is the difference estimate, one form of 
which is as follows: 


Difference Estimate Blowup Estimate Flying Time Blowup Esti- 

of Total Flying =of Total Flying + | for all Squad- — mate of Total 

Time, August Time, August rons, July Flying Time, 
July 


In our example, the difference estimate = 8,960 + (9,000 —7,777) = 10,183 hrs. 


5. Linear Regression Estimate 
The linear regression estimate is similar to the difference estimate described 
in the previous paragraph, except that the term in parentheses is multiplied 
by “b,” where “b” is the common or garden variety of regression coefficient 
. encountered in simple linear correlation (in the estimating equation y=a+bz).! 
Thus, the equation for the linear regression estimate is X’’= X’+b(Y—Y’) 
where: 


X” is the linear regression estimate for August 
X’ is the blowup estimate for August 

b is the regression coefficient 

Y is the actual total for July 

Y’ is the blowup estimate for July 


In our example, the linear regression estimate, 
X"’ =8,960+.8959(9,000 — 7,777) = 10,056 hours.’ 


6. Summary 

We have seen that it is necessary not only to seek out the most efficient 
sampling method, but to select an effective estimating procedure in a specific 
sampling problem. In our simple illustration the following results were ob- 
tained: 


1 Readers who were under the impression that correlation is 2 type of square dance are referred to the nearest 
statistics text. 
1 For the skeptical (and assiduous) reader who wishes to check this answer, 


- Hin - 


my; - 
Y; is flying time for * squadron, July. 
Xj is fiying time for ¢ squadron, August. 
Y is average flying time, all squadrons, July. 
X is average flying time, all squadrons, August. 
= indicates that the products (or squares) are to be summed over all 10 squadrons. 
Nore: Normally, 6 would be computed from the sample data for the two successive months, since data for all 
squadrons would not be available for the later month. 
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Hours Flown % Error 
Actual Total for August 10,000 
Blowup Estimate 8,960 10.40 
Ratio Estimate 10,369 3.69 
Difference Estimate 10,183 1.83 
Linear Regression Estimate 10,056 0.56 


When data for successive periods (or fo separate items) are correlated, the 
ratio estimate will often yield some impr ement over the blowup estimate. 
The linear regression and difference estimates will also usually yield better 
results under such circumstances, although the computations, particularly for 
the regression estimate are more laborious. 


CHAPTER VII. SAMPLING AND NON-SAMPLING ERRORS 
1. General 


The very select group consisting of our readers is doubtless convinced by 
now that while sampling is not necessarily fun, it is frequently feasible. Others, 
with less erudition and acumen are loathe to accept any results that haven’t 
been obtained from a complete count. The latter mutter querulously about 
sampling errors and sampling biases, but are unmindful of certain facts, to-wit: 

a. When sampling is performed so that every tnit in the statistical universe 
has a chance of being selected and the probability of selection is known, the 
errors of sampling can be controlled satisfactorily. Sampling in this manner is 
known as probability sampling. 

b. It is true that there are a number of non-probability sampling methods 
in which the sampling error cannot be controlled. However, competent sampling 
technicians avoid this type of sampling except in special situations, and only 
when the effects of using data with unknown sampling errors and biases are 
not important. 

c. Any data collection and processing system is subject to a variety of 
errors, and the non-sampling errors in such a system often are of greater mag- 
nitude than the sampling errors. Moreover, the degree of error of a sample 
estimate can be measured (for probability samples) by means of existing sta- 
tistical theory. Non-sampling errors on the other hand are generally not meas- 
urable (except by use of sample checks). It is true that sampling errors can be 
avoided by making a complete enumeration. However, overall accuracy can 
often be increased by conducting a sample survey instead of a complete count. 
The smaller scope of the sample project makes it possible to be more selective 
in assignment of personnel for the survey operations; to be more thorough in 
their training; and to be able to concentrate to a much greater degree on re- 
duction of the non-sampling errors. The net effect of a sample survey (as com- 
pared with a 100% enumeration) is often a more accurate answer, achieved 
with fewer personnel and less paper work, at a lower cost, in a shorter time. 

The following list, by no means complete, will afford some idea of the more 
important types of errors to be concerned about when gathering data. 


158 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1957 


2. Reporting and Processing Errors 


It is a well-known statistical paradox that the average age of women over 
forty is under forty (as reported by the ladies). For equally esoteric reasons a 
kumquat picker may report his occupation as citrus grower, while the man 
who sews up the buttonholes on jacket lapels avers he is a couturier. One per- 
son may even report different figures for the same item (e.g. his age as listed 
on a life insurance application and as reported to his draft board). Basic data 
may be entered incorrectly, copied erroneously, coded improperly, manipulated 
ineptly or omitted completely. A recent Statistical Services project at this 
Headquarters entailed collection of flying time and fuel issue data from basic 
records. These data revealed that some aircraft had flown 20, 30 or even more 
hours without a drop of fuel. Unfortunately, this feat was accomplished not by 
using a new and revolutionary carburetor, but by omitting fuel issue entries 
in the DD Form 781-2. 


3. Non-Response Errors 

Very serious biases may arise from using the data reported only by respond- 
ents to a questionnaire or the “at-homes” in a household survey, whether it 
apply to a sample or the entire universe. A daytime house-to-house survey 
conducted during the week would undoubtedly yield a very poor estimate of the 
number of lady plumbers. An evening follow-up of the not-at-homes would be 
necessary to overcome this obvious bias. It is always dangerous to assume that 
the non-respondents or absentees possess the same characteristics, and to the 
same degree, as the respondents and at-homes without a sample check to assure 
the validity of this assumption. 


4. Errors in Sample Selection 


An all too common source of error is the improper and unscientific seleetion 
of sample units. “Chunk” samples selected for convenience are in this category. 
The enumerator who picks Joe’s Bar and Grill as the place to conduct a survey 
on temperance might be cited as a perpetrator of slightly biased “chunk” 
sampling. “Judgment” samples, though sometimes yielding good results, are 
risky for the very reason that even the judgment of experts varies considerably 
in the selection of representative samples. “Quota” samples, too, possess the 
defect of depending on the inclination of the enumerator rather than on a 
random selection. Such a sample might specify, for example, that the enumera- 
tor obtain data from 50 males, age 35-40, with incomes from $5,000 to $7,000. 
While this restricts him to some degree from making a thoroughly unrepresenta- 
tive selection he may elect to conduct his interviews during a double-header at 
Yankee Stadium. 


5. Sampling Errors 


The result of a sample survey will hardly ever agree completely with that 
which would have been obtained from a complete enumeration carried out in 
the same manner. The difference between the two is the sampling error. If a 
probability sample design is used, it is possible to predetermine the size of 
sample needed to obtain a specified degree of precision. This most important 


4 
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and most felicitous property of probability sampling usually makes it possible 
to keep the sampling error within the desired limits. 


CHAPTER VIII. SAMPLING AT STAT SERVICES, HQ USAF 
1. Introduction 


The earlier chapters of this series have listed some advantages of sampling 
and have presented a simplified exposition of sampling methods. This chapter 
outlines the organization of the sampling function in the Directorate of Sta- 
tistical Services, Hq USAF and describes some sampling projects underway in 
the Directorate. 


2. Sampling Organizations in the Directorate 


a. Sampling Branch—Personnel Statistics Division.—This unit has the re- 
sponsibility of planning and operating personnel statistical systems which are 
primarily based upon psychological and sampling techniques. Its major re- 
curring operation at present is the quarterly Sample survey of Air Force Mili- 
tary Personnel. 

b. Assistant for Sampling—Plans and Liaison Group.—The function of this 
unit is to provide technical assistance on sampling and other statistical method- 
ology to all divisions of the Directorate, to other Air Staff agencies at Hq 
USAF, and to Statistical Services units of Major Air Commands. 


3. Some Typical Sampling Projects 

a. Quarterly Sample Survey of Military Personnel——This survey was in 
operation before the Air Force became a separate establishment but has under- 
gone considerable growth and development since that time. A wide variety of 
information covering characteristics, opinions and intentions of Air Force mili- 
tary personnel is collected in the survey. This information provides guidance 
for personnel policies and actions that have considerable impact on virtually 
all Air Force activities. Data from the survey find their way into nearly every 
major personnel report in Hq USAF. 

b. Sample Survey of Air Force Clothing Sales Stores.—This survey was initi- 
ated at the request of DCS/Materiel, Hq USAF’. Imbalances had developed in 
the inventories of some replacement clothing items because of inadequate 
“tariffs” or size distributions of these items. In addition, it was believed that 
stockage of the stores according to sales volume and climatic location required 
some attention. 

On the basis of pre-test data it was determined that a sample of 35 out of 
more than 200 stores world-wide would yield acceptably reliable clothing size 
distributions. Sales stores were stratified by size of store within climatic regions 
and samples were selected from each regional listing. 

The survey is expected to yield: accurate size distributions for principal 
clothing items; the distribution of sales by item by climatic region; and the 
distribution of sales by item, by size of store. It is believed this information 
will help achieve improved purchasing and stockage procedures; smaller inven- 
tories of both raw materials and end items; higher rates of inventory turnover; 
and better service to Air Force personnel. 
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c. Sample Reporting of Air Force Activities—Data on a very wide variety of 
activities are necessary to run the largest business in the world—the United 
States Air Force. The Director of Statistical Services is presently engaged in a 
continuing project to discover whether it is possible to substitute sample re- 
porting for complete reporting in some areas of activity without loss of neces- 
sary detail or desired accuracy in the results. Preliminary research into this 
area was begun recently, using data from one major command (MATS) on 
aircraft, flying time, hours in commission, engine removals, etc. 

In brief, it may be possible to obtain reliable monthly data for specific items 
in the following way: (1) prepare a complete report covering all USAF units 
once a year in order to set up a checkpoint or benchmark; (2) sample a rela- 
tively small number of units during each of the eleven succeeding months- 
(3) link the monthly sample estimates to the benchmark from month to month, 
with a complete enumeration providing a new benchmark in the 12th month. 

Data from a second major command (ConAC) have been received and other 
commands will be contacted in ensuing months. Feasibility of sample reporting 
will be tested for a number of Air Force reports on a continuing basis. Results 
of research into this project will be announced to the field through the Stat 
Tech Letter. 

d. Sample Survey of Fuel Consumption Rates.—This survey was established 
in January 1955 to develop more accurate fuel consumption rates (by type and 
model of aircraft) than were currently available. Existing rates were found to 
be understated because of failure in some instances to enter fuel issue data on 
the DD Form 781-2. At the outset approximately two-thirds of all aircraft 
types and models were excluded from the survey since they accounted for less 
than two per cent of USAF-wide fuel consumption. Organizations were sampled 
for each basic type and model. In organizations where more than one squadron 
flew the same types of missions in the same type and model aircraft, only one 
squadron was selected for inclusion in the survey. These methods made it 
possible to reduce the workload at all echelons, including the job of machine 
tabulation and analysis at this Headquarters. 

The existence of a stock fund accounting report reflecting total fuel costs 
made possible an overall check of the fuel consumption rates developed as a 
result of the sample survey. Total costs estimated by applying the sample rates 
were found to be over 99 per cent accurate. The sample survey fuel consump- 
tion rates have been published in the Peacetime Planning Factors Manual. 
In addition, reporting of fuel issues en the 110-A report has been eliminated 
as a result of the sample survey. It is estimated that some 100,000 manhours 
will be saved annually because of this lessened reporting requirement. 


CHAPTER IX. SUMMARY 


Sampling is a valuable tool for obtaining data quickly, cheaply and accu- 
rately. Scientific application of sampling principles by government agencies, 
universities and private companies can do much to eliminate mistrust of sample 
results occasioned by improper sampling. 

A variety of sampling methods and estimating techniques afford the user a 
wide selection for specific situations. He can choose the method or combination 
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of methods that will yield a desired degree of precision at a minimum cost; or, 
if given a fixed budget for a sample survey, he can make his selection in a way 
that will yield the maximum precision for a fixed cost. 

The Statistical Services Directorate, United States Air Force is presently 
engaged in a number of sampling projects and plans to expand its use of this 
technique in the future. To conduct the vast business of the Air Force effec- 
tively requires data from a wide variety of activities. The sampling projects 
of the Air Force represent one step in its program to reduce the volume of paper- 
work without reducing the quality or timeliness of reported data. 


NATIONAL INCOME STATISTICS OF 
UNDERDEVELOPED COUNTRIES 


Harry T. 
Stanford University 


INTRODUCTION 


HE postwar years have seen an impressive accumulation of statistics of 

national income and expenditure in the less developed countries. Some 50 
countries in Asia, the Middle East, Africa, and Latin America now compute 
such statistics on a regular basis. Despite enormous progress, various weak- 
nesses in these statistics still exist. Since these statistics are being increasingly 
used for various purposes, it may be of interest to describe their shortcomings. 
A brief paper on such a broad topic can serve as little more than a general 
introduction, but its purpose will be fulfilled if it conveys an idea of the ap- 
proximate character of the estimates.! 

In fairness, it must be stated that the shortcomings discussed below should 
not be attributed to the national income statisticians responsible for the re- 
spective estimates. Their achievements have been considerable, considering the 
small staff and inadequate statistical material with which they worked. More- 
over, the structure of underdeveloped countries presents almost insuperable 
difficuities. These are the widespread prevalence of small units of production 
whether in agriculture, industry, trade, transport, or services; the lack of 
specialization and the variety of activities undertaken by these units of pro- 
duction even though small; the absence of accounting records and even uniform 
units of measurement; the variation in quality, size, and design of a particular 
commodity whose production is not subjected to the standardization imposed 
by the machine process and scientific agriculture; the importance of subsistence 
production, etc.? 

In speaking of Asia and Latin America, Japan, Puerto Rico, Argentina, and 
Chile are excluded from the discussion below. Excluded from sections 1, 2, and 3 
are African countries. The statistics of the latter countries outside the European 
sectors may be said to be of considerably lower quality and coverage.’ In the 
appendix, some of the estimates of African countries are briefly noted. 

1 The sources of these data are given in the United Nations’ Statistics of National Income and Expenditure, 
Statistical Papers, Series H, No. 9. This publication presents semi-annually summaries of current figures for nearly 
60 countries, Another publication of the United Nations, Methods of National Income Estimation, Series F, No. 8, 
gives convenient, though brief, descriptions of methods and sources used in various countries. Also see appendix 
notes at the end of this paper. 

2 A more detailed discussion of these obstacles is attempted in the writer's paper “The price system and national 
income and product,” in Review of Economics and Statistics, XX XIII (1951), 248-54. 

* See Phyllis Deane, Colonial Social Accounting, Cambridge, England, 1953, and a review of the book by 
William O. Jones in the Journal of the American Statistical Association, 50 (1955), 665-76. The African estimates are 
excluded from the discussion on the following grounds: Since the national market (embracing both the native and 
the European sectors) for factors and for products is probably not sufficiently evolved in any of the African coun- 
tries, the use of local factor-prices and product-prices in different regions of a given country to aggregate incomes, 
output, and expenditures does not give a meaningful nationai total, even if a single unit of currency prevailed. In a 
country like the United States, the existence of a national market (i.e., extensive regional mobility of factors and 
products) sets up strong tendencies toward the equalization of annual real earnings of each factor. The dollar 
everywhere is a tolerably adequate measure of productivity per unit of input. In the absence of a national market, 
the opposite is the case. It may be partly for this reason that the prevailing national income statistics for African 


countries give such strange results (see Appendix). This point is not only relevant for Africa but perhaps also for 
certain regions of large countries in other areas (e.g., Breil, India, Peru, etc.). 
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1. STATISTICS OF TOTAL OUTPUT 


With few exceptions, the total, whether national income at factor cost or 
gross national product at market prices, is arrived at by the addition of net 
value added.‘ In most countries, net value added is obtained from production 
statistics for agriculture and a few commodity-producing sectors. For handi- 
crafts and tertiary industries, distributive shares applied to labor force data 
are used. This is the usual method of computation, although in a number of 
Latin American countries (Cuba, Colombia, Peru, etc.) distributive shares are 
more extensively used to estimate income originating than agricultural output 
statistics. 


Output Derived from Physical Productien Data 


Under this method, output is multiplied by unit prices and inter-sector ex- 
penses are deducted. Each of these three components will be discussed sepa- 
rately. This method is responsible for the derivation of the major mare of total 
output of underdeveloped countries. 

Total physical output in agriculture for the major crops is derived in turn 
by multiplying acreage for each crop by average yields, except for years in 
which agricultural census data exist. In Asian countries, these output data for 
the major crops have been collected traditionally for tax administration. Com- 
pared to data of other underdeveloped countries, they are collected systemati- 
cally and constitute the strong points of Asian estimates. This is generally not 
the case for most Latin American countries, which must depend on decennial 
censuses of agriculture. The Asian data for total acreage vary in coverage and 
quality from country to country. For India, some 20 per cent of the localities 
do not report their figures, while for Burma and Ceylon the coverage appears 
to be better. Average yield statistics are probably subject to a certain under- 
statement like all tax statistics and depend very much on the skill and objec- 
tivity of the agent making the estimates. In most Asian countries, village 
officials survey a certain portion of the crops on the field before harvest and 
their estimates are entered as typical yields. These officials have the advantage 
of experience, but since their estimates are rarely compared with the actual 
harvested crops, mistakes in judgment are difficult to correct.5 

The data on the minor crops of most countries (both acreage and yields) are 
based on shakier ground. Frequently they are the judgment of agricultural 
officials of the government, who base their estimates on some past agricultural 
census, current experience, etc. In some of the countries, information on live- 
stock is collected from time to time and to the total in each category some yield 
estimate is applied to obtain the products of livestock. For the smaller live- 
stock such as chickens and for a significant part of forestry and fishery, firm 
data rarely exist and rough estimates are made in consultation with appropriate 


experts. 


4 The exceptions are mainly small countries tike the Dominican Republic, Ceylon, and Malaya—countries 
where import statistics furnish information on a large proportion of national expenditures (one-fourth or one-third). 
In India, the Indian Council of Agricultural Research was asked by the government to undertake crop- 
cutting surveys on a random sample basis as a means of checking the accuracy of yield estimates. In general, the 
latter were found to be understated. 
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Price statistics for the major crops also are considerably better than for the 
minor crops, livestock products, forestry, and fishing. The chief problem posed 
by major crops’ prices is the extent of regional coverage. In most cases, they 
do not appear to comprise prices at which villagers sell their crops but prices 
in the major wholesale markets, usually in the nearby towns or cities. Since a 
large part of the output is actually sold or consumed in the village, the use of 
town prices overstates the value of output. Offsetting this tendency is the fact 
that the prices used for valuation of the total crop are usually those collected 
at harvest time when the excess of supply over demand is normally greatest 
and prices lowest. About one-third to one-half of the crops are retained and 
consumed by the farm families throughout the year. The proper valuation of 
retained crops is closer to an annual average which is considerably higher than 
harvest prices, judging from available scattered evidence.® 

Outside of a few commercial crops (sugar, coffee, tobacco, fruits, etc., in 
Latin American countries, e.g., in Cuba) appropriate statistics on production 
expenses are not available to the national income estimators. Where income 
taxes cover the larger farms, tax data give some notion of production coef- 
ficients in some of the crops. Lacking systematic investigation on production 
costs, the .ational income estimator must rely on scattered local surveys, im- 
port data (e.g., imports of fertilizers), expert opinion, and past censuses.’ 
Though gaps in cost statistics are considerable, in the aggregate they do not 
usually comprise more than one-fourth of the gross value of agricultural pro- 
duction. The over-all errors introduced by their deficiencies cannot be too 
large. 

The general impression of the statistics of net value of agricultural output in 
nearly all underdeveloped countries is that of significant understatement. This 
may partially account for the extremely low per capita output of the agricul- 
tural sector relative to that of the nonagricultural sector (see Appendix). This 
appears to be true even for countries where the statistician undertakes to make 
some rough, arbitrary adjustment for the deficiencies. When venturing into 
unknown areas cautious statisticians are understandably conservative (for 
example, in the case of the Philippines, discussed in Item 2 of the Appendix). 

Mining plays a relatively insignificant role in Asian countries, though this 
is not the case in several Latin American and Middle Eastern countries where 
oil is important. Generally, these are produced by large, organized units and 
data are usually plentiful. However, mining statistics do not cover the output 
of individuals operating on their own in quarrying, clay, and stone collection. 
This type of activity is more important than in advanced countries. 


Output Estimated from Employment Data 
The general procedure is to use labor-force figures based on the nearest 
population census and earning statistics. An addition is made for profits, 


* Seasonal fluctuations are large in underdeveloped countries, owing probably to small inventories, inade- 
quacies of storage facilities, and the small degree of inter-regional commodity mobility. See data, for example, in 
India, Min, Food and Agric., Dir. Econ. and Stat., Indian Agricultural Price Statistics. 

7 Some agricultural censuses attempt to collect a number of cost items such as fertilizer purchases, irrigation 
costs, seeds, etc. (e.g., the 1948 Philippine census). But most of them do not. The trouble to which estimators are 
forced to go in constructing cost estimates is fully described in the case of Brazil, where there is an “almost total 
lack of information on cost of production” (Item 7 in Appendix). 
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usually based on tax sources. This method is followed for all other sectors and 
accounts for roughly one-half of the total output in most underdeveloped 
countries. (A few countries, especially those without labor-force data, attempt 
to estimate some of these sectors by the physical output method. Thailand is: 
an example. Only a severely limited coverage seems to be attained.) 

One problem in the use of labor-force data stems from conceptual difficulties 
in their application to the structure of underdeveloped countries. This can be 
seen by examining the ratio of labor force to total population as given in the 
Yearbook of Labour Statistics, 1955, of the International Labour Organization, 
Table 1, pp. 6-7. First, the variation of these ratios between countries within 
Latin America, within Asia, and within Africa is much larger than within 
Europe, suggesting that enumerators in the former areas were unable to in- 
terpret the definition of labor force uniformly.* Second, the ratios are signifi- 
cantly lower for the underdeveloped countries than for the developed (Roughly 
37 per cent for Asia, Africa, and Latin America, and 45 per cent for European 
countries). Even allowing for the higher proportion of children (due to high 
mortality rates) in the underdeveloped areas, the proportions are too low for 
subsistence economies. 

Where most of the population can be demarcated into economically active 
and inactive, as in the developed countries, the concept of labor force has clear 
meaning. In the underdeveloped economies the demarcation is not sharp. A 
much larger proportion of housewives and students do a significant amount of 
part-time work. It is difficult for even the best trained enumerator to decide 
how to classify these individuals.* The part-time workers excluded in the non- 
agricultural labor force understate the labor force for national income purposes. 

The quality of annual income data varies from country to country. The 
sources of data are social security reports (Brazil), industrial censuses (Peru, 
Colombia), income censuses (Philippines 1948, Cuba 1943, Ceylon 1946), tax 
statistics, sample surveys on wages and salaries and miscellaneous, scattered 
collection of data (India), usually from the departments of labor or social wel- 
fare (see Methods of National Income Estimation, pp. 9-12). With few excep- 
tions (e.g., Cuba and Brazil) social security coverage is severely limited. In- 
dustrial and establishment censuses usually cover a small part of industrial 
output and when they are made to cover small industries and handicrafts, the 
data collected are not usable (see United Nations Statistical Office, 7ndustrial 
Censuses and Related Inquiries, Series F, No. 4, 1953). Income censuses, for 
example in the Philippines, have wide coverage but the data that can be col- 
lected are for a single month. The problem of converting a given month’s in- 


* See India, Min. Finance, Dept. Econ. Affairs, First Report of the National Income Committee of India, April 
1961, p. 55, where a table is presented showing considerable variation (from 29 to 55 per cent) in the labor force as 
between provinces. 

* See, for example, the vague instructions given to the enumerators in the Indian Census of 1951, the 1946 
Census of Ceylon, and the 1947 Census of Malaya. See also United Nations, Application of International Standards 
to Census Data on the Economically Active Population (Population Studies No. 9, New York, 1951), p. 11. This 
problem, like so many statistical problems in underdeveloped countries, inheres in the very structure of a pre- 
industrial economy, and no easy solution is ia sight. From the viewpoint of national income estimation, a desirable 
solution may be to include all part-time workers, irrespective of their usual status as students or housewives. On a 
small sample basis, these workers may be asked information on the number of weeks or months worked during the 
year. J 


— 
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come to an annual basis is difficult owing to wide seasonal variations in eco- 
nomic activities. The following dilemmas may be noted: if the data are good 
the coverage is limited, as in social security records and establishment censuses; 
if the coverage is good the data are difficult to use. 

Most countries are forced to rely on a combination of the above sources with 
a heavy emphasis on scattered, miscellaneous wage statistics collected by labor 
and welfare departments (most of which are daily, weekly, or monthly rates).'° 
It is difficult to say whether in the net these data understate or overstate 
average incomes originating, since there are too many important factors in- 
volved. The data collected in the above sources (except income censuses) are 
usually from the larger concerns and may be expected to be higher than those 
in small units in towns and villages. Whether this produces an upward bias 
depends on the adjustments (for part-time employment, seasonality, industry, 
sex, age, etc.) made before applying the data to the total labor force. The 
adjustments made appear to vary for countries, although detailed information 
is frequently not published. 

An important omission is the nonfarm output of farm families. This type of 
production is especially relevant in Asian countries where minute holdings re- 
quire the peasant family to produce a number of products during the off-season. 
These items are clothing, footwear, and other personal effects, furniture, mats, 
kitchenware, and other household goods, firewood collection, own-account 
construction, etc."' Precisely because these are home-produced and home-con- 
sumed they elude the efforts of the statistician to measure them. Consequently 
very few countries bother to include them, although in a few instances (e.g., 
India) coverage of a large portion is attempted. 

The discussion shows that with few exceptions, if any, existing estimates of 
gross product in the underdeveloped countries tend to be significantly under- 
stated, particularly with respect to physical output in agriculture and to the 
use of labor-force data to measure work done in nonagriculture. It is, of course, 
difficult to guess the margin of understatement, since in a complex aggregate 
such as national product there is a great deal of offsetting of opposite errors 
in the items. If one is permitted to speculate, it will not be surprising if the 
understatement in the estimates of some countries discussed in the Appendix 
runs in terms of 10, 20, or 30 per cent.” 

1° Monthly wege sampling appears to be successfully employed in Japan. Here again, the coverage is limited 
to firms habitually employing more than 30 persons, which would exclude most of the nonagricultural labor force 
in underdeveloped countries. The fundamental obstacle to the collection of data on annual wages of workers and 
annual earnings of working employers in small firms is that the underdeveloped economies have not evolved to a 
level where annua! incomes exist empirically in a clear-cut fashion. The work of these labor force members is ir- 
regular, seasonal, haphazard in duration; payments are made in kind, on a commission or cooperative basis or else 
on a complex system of cash payments; the workers are self-employed, use unpaid family help, apprentice, women 
and children, etc. These conditions are formidable for anyone trying to ascertain the annual incomes of such workers, 
who comprise the majority of the nonagricultural labor force. Since the concept of enterprise or establishment with 
its working force (for a period of one year) is elusive, the only way out of this dilemma is to obtain income data 
from the households as the unit of the survey. But more work along this line is needed before such sampling can 
turn up reliable data. (For a discussion on property incomes, see next section.) 

1 A detailed list of nonfarm activities in a relatively underdeveloped area is given in J. F. Embree, Suye Mura, 
A Japanese Village (University of Chicago Press, 1939), p. 51. The colonial period in American history supplies 
appropriate examples of home-produced goods. 

18 The procedure is followed in some countries of assessing the accuracy of the total by attaching subjective 
error margins and these generally run in terms of plus and minus 10 to 15 per cent. The discussion above would 
point to margins with larger plus then minus. Moreover, 10 or even 15 per cent seems low, especially if one compares 


these with Kuznets’ 10 per cent in National Income and Composition 1912-1988 (National Bureau of Economis 
Research, New York, 1941), p. 528. 
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2. DISTRIBUTIVE-SHARES STATISTICS 


Income originating in each industry is defined to be divisible into distributive 
shares originating in each industry. If computed from product data, income is 
obtained as a residual by deducting certain expenses, and there is no need to 
separate the various income shares. Most Asian countries do not attempt to 
separate each share comprising net income (e.g., India, Ceylon, Thailand, 
Pakistan, Indonesia). A number of Latin American countries (Cuba, Mexico, 
Peru, Chile, Colombia, Venezuela) obtain income originating in each industry 
as a sum of shares. 

The most important distributive shares, employee compensation and income 
of proprietors, accounting for two-thirds to three-fourths of gross domestic 
product, have in effect been discussed in the foregoing section. The conclusion 
that these are generally understated applies to both shares, and the description 
of sources and methods in Section 2 is mainly relevant to these shares. The 
rest of the shares are rent, interest, profit, depreciation, and indirect taxes. 

Rent is almost exclusively rental on dwellings in Asian statistics. For coun- 
tries with complete distributive-shares breakdown, rental on dwellings forms 
most of the total rent shown. Nearly all agricultural rent and a major part of 
industrial rent are implicitly included in profits and proprietors’ income. (In 
the Egyptian estimate, agricultural rent is separated out.) The general pro- 
cedure is, first, to obtain an estimate of total dwellings and apply a suitable 
rental rate. The result, after gross elements are deducted, is not only actual 
rent paid for dwellings but also imputed rent of owner-occupied premises. (In 
detail, the method can be very involved, owing to the need to avoid duplica- 
tion.) 

Total dwellings are based on housing surveys and censuses, as in Mexico and 
Malaya. Some countries do not have such censuses; in such cases, the number 
of families taken from population censuses is used as a basis to estimate dwell- 
ings. Some of the housing censuses do collect rents on tenant-occupied houses 
and for the large cities. These, with various adjustments, and other supple- 
mentary material such as income-tax data and family-budget surveys give 
estimates of rent in the large urban areas. The real problem comes with the 
estimation of rental rates in the towns and villages where the vast majority 
live. Here the houses are generally owned by the occupants and an appropriate 
measurement of services rendered by housing is difficult to find. Understand- 
ably, some countries do not attempt to make the estimate (e.g., Ceylon). Some 
countries that do, simply take a certain fraction or multiple of total urban 
rentals, depending on the relative size of the urban and rural population 
(Malaya). 

Other countries (India, Indonesia, Pakistan) calculate the value of rural 
dwellings by multiplying an estimated number of dwellings by an average 
value, obtained from either tax or survey sources. To the value of rural dwell- 
ing, an average rate of interest ruling in the countryside is applied and expenses 
are deducted. The rationale of using interest rates may be questioned. Some 
relation may be postulated between rent and interest rate if houses are custom- 
arily built with borrowed funds, but this is hardiy the case in underdeveloped 
countries. A further complication is that rural dwellings in these countries are 
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not only places of residence but also to a large degree workplaces. Since this 
part of rent is implicitly included in the income of farmers, the procedure leads 
to duplication. Statistically, for nearly all countries, the figures in each step of 
the calculation (value of housing, average interest rates, and expenses) are 
extremely rough approximations. The estimates of rural rent are therefore of 
limited value. 

Corporate profits aad interest are usually taken from tax statistics, especially 
from income taxes. Evasion and under-reporting are regarded by the estima- 
tors to be considerable, but due to the absence of a reasonable basis for adjust- 
ment, none is usually made (e.g., Brazil). In Egypt, profits from tax data are 
raised 5 per cent. Profit and interest statistics taken from questionnaire surveys 
are not likely to be very different from those reported in the tax statistics." 
In Cuba, profits of sugar plantations are computed by taking an average profit 
(based on trade sources) per bag of sugar. The understatement of profits and 
interest is probably fairly large in the estimates of most countries, especially 
in Latin America, though Mexico is an exception. 

Depreciation is difficult to estimate in all countries, but particularly so for 
underdeveloped countries. Tax sources, inadequate as they may be for this 
purpose in developed countries, are not even available to the estimator for a 
vast segment of the capital assets of underdeveloped countries. There is hardly 
any alternative but to take an arbitrarily fixed percentage (e.g., in Malaya, 
Indonesia, Thailand, etc.) ranging from 3 to 8 per cent of gross output. In 
Cuba, use is made of the Puerto Rican percentage. In some countries (e.g., 
India), where data on quantities of a few simple items of agricultural equipment 
(plows and carts) are available, an attempt is made to multiply these quanti- 
ties by some suitable unit value. Capital consumption is arrived at by assuming 
average durability of the equipment. The estimates are very approximate be- 
cause of the difficulties of obtaining suitable average values and average dura- 
bilities. Most of these items are handmade and their types and qualities vary 
considerably from village to village, depending on needs and available ma- 
terials of construction. 

Here, again, the statistical difficulties with regard to depreciation stem partly 
from conceptual reasons. In the small units of production in underdeveloped 
countries, depreciation policies are not followed in actual practice. The small 
equipment and structures of these units are repaired and when no longer re- 
pairable are duly replaced. A replacement policy suffices for these simple items. 
It is hard to measure activities which do not exist in fact.“ 

Ind:rect taxes are obtainable directly from government accounts and so do 
not raise any major statistical problem. The problem faced by statisticians is 
one of definitions. Are land taxes, for example, direct or indirect? In Asian 
countries, taxes on agricultural land behave more like direct than indirect taxes, 
although this may not be the case in developed countries. In the First Report 
of the Indian National Income Committee, land taxes were treated as indirect 


4% See Amor Gosfield, “Input-Output Analysis of Puerto Rican Economy,” paper presented before the Con- 
ference on Research in Income and Wealth, October, 1952, in Input-Output Analysis: An Appraisal. Princeton: 
National Bureau of Economic Research, Studies in Income and Wealth, 18 (1955), pp. 321 ft. 

% Depreciation accounting, like machinery and the factory, is a product of the Industrial Revolution. The line 
betweer gross and net income or product is necessarily blurred for pre-industrial economies. 
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taxes, but in the Final Report they were handled as direct taxes. In Latin 
America, land taxes are regarded as indirect taxes. 

To summarize, gross output viewed as distributive shares also seems under- 
stated for most of the underdeveloped countries. 


3. EXPENDITURE STATISTICS 


Burma, Ceylon, Malaya, and the Philippines, in Asia, and Colombia, 
Ecuador, Dominican Republic, Guatemala, and Venezuela, in Latin America, 
have worked out the expenditure side of the account. Expenditures are dis- 
cussed under the following headings: private consumption expenditures, gen- 
eral government current expenditures, gross domestic investment, and excess 
of exports of goods and services over their imports. (The various methods used 
are described in the UN’s Methods of National Income Estimation.) Since con- 
sumption expenditures are obtained as a residual, they are discussed last. 
Besides the foregoing countries, there are a number of others which estimate 
investment. 

Government expenditures are usually the best estimate in the expenditure 
total, being derived from budget statistics. It is true that the accuracy of the 
latter in many countries leaves much to be desired—e.g., Cuba before 1952. 
Nor does the coverage of local government accounts, especially on « village 
level, seem adequate in a number of countries. For a large country 1. .e India 
the collection of local government accounts is an enormous task for the national 
income staff. In Latin America the main difficulty is with municipalities. When 
details of local government expenditures are not available, the national income 
staff is faced with the problem of separating out such expenditures as purchase 
of land, second-hand equipment, and transfer payments, which do not belong 
to gross national product. These difficulties with local government data are due 
in part to the fact that these governments are only now beginning to bring 
together all accounts into one large consolidated total. In part it is conceptual. 
As one reaches down to the grass roots level of political authority, the boundary 
between what is strictly government and what is voluntary co-operation begins 
to blur. In Asian countries, the voluntary services rendered by villagers for 
administrative, clerical, statistical, ceremonial, agricultural, security, safety, 
and other purposes are by no means negligible. Such services typically are not 
included in the receipts and expenditures of the local governments. 

Gross domestic investment comprises private and public investment. The latter 
is culled from the budget. Aside from the difficulties of separating capital 
expenditures from current expenditures and from expenditures on land for 
construction site and from other existing assets, the total obtained is fairly 
good. 

Inventory changes in private investment are nearly impossible to measure 
comprehensively, owing to the lack of data. Nor is it easy for the estimator to 
define inventory changes in economies where the unit of production is pre- 
dominantly the family. For example, the peasant stocks up grain not only for 
sale but for household consumption. Since inventory in the national accounts is 
business inventory, i.e., stocks held for eventual sale or use in the productive 
process, the separation of peasant-held stocks into home use and business use 
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is most difficult even for the owners of the stock.“ Accordingly, with few 
exceptions, countries do not attempt to measure changes in inventory compre- 
hensively. In the Philippines, only changes in livestock and sugar stocks are 
included; in Cuba, only sugar and tobacco stocks; and in Malaya, only rubber 
and tin stocks. In Ceylon and Indonesia, inventory changes are not estimated 
at all. Burma is an exception. Inventory changes are estimated by taking 
changes in the volume of commercial loans. Since the latter are not issued 
solely for inventory purchases, nor all inventories purchased out of loans, the 
method is not too reliable. 

Fixed-capital formation is chiefly derived from import statistics. One prob- 
lem in going through the import data is the separation of capital goods from 
consumer durables (automobiles, bicycles, sewi:z machines, refrigerators). 
Frequently, a firm basis for making the separaticn is not available, but for 
automobiles, registration statistics can be used. Import values are converted 
to prices paid by final buyers by adding a mark-up, usually from 30 to 50 per 
cent. These are often estimated after consultation with importers and other 
appropriate dealers; in other cases a percentage is assumed and added to the 
aggregate. 

A more important problem is the estimation of domestic production of 
capital goods. The volume of such production is more important in the larger 
countries than in the smaller ones with extensive foreign trade (Ceylon, Philip- 
pines, Malaya, Cuba, etc.). When available, factory production statistics are 
used (e.g., Philippines and Colombia). 

Since wood and other principal construction materials are domestically ob- 
tainable, for most countries the import and factory production data (e.g., of 
cement) fail to cover a large portion of total materials for construction. Labor 
is even less adequately accounted for. In Latin American countries, building- 
permit statistics are used by the estimators. But in most cases these permits 
are collected only for the main cities. The coverage excludes smaller structures 
and outlays for major alterations. Various adjustments are attempted, but at 
best—in Brazil, for example—they are no more than informed guesses. 

The crucial problem concerns minor tools, equipment, and minor structures 
produced in the villages, oftentimes right on the farm. These comprise in the 
aggregate a large part of gross investment and are not covered by import, pro- 
duction, construction, or government budget statistics. Specifically, these are 
carts, wooden plows, hoes, sickles, and similar hand tools, wells, paths, 
ditches, dikes, fences, barns, sheds, and other minor structures (and corre- 
sponding tools and structures for the handicraftsmen and traders), all of which 
last more than a year. One or two countries—India for instance—try to esti- 
mate these items, but the basic data underlying the estimates are poor. In 
general, most of these items are excluded from the estimates of most under- 
developed countries. Their totals may, therefore, be interpreted to comprise 
fixed capital formation of the fairly modern type, comparable to the western 
definition of capital goods. And for most uses such a total, though incomplete, 
may be deemed sufficient. Small equipment and minor structures in the rural 


% A solution would be to depart from the national accounting definition and regard all year-end increases of 
stock as inventory increase. 
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areas may be considered either as current expenditures (and therefore inter- 
mediate goods) or as assets whose consumption in normal years for the economy 
as a whole is roughly equivalent to replacements. For a nondynamic economy 
the assumption is thought to be sufficiently valid. On this assumption, their 
effect on development is taken to be negligible, although, to the extent of the 
omission of such replacement expenditures, gross output is understated. In 
sum, the estimates of fixed capital formation (defined in a restricted sense) 
in underdeveloped countries, based as they are mainly on import, production, 
construction, and budget statistics, are acceptable. 

The excess of exports over imports of goods and services is taken from bal- 
ance-of-payments statistics which are available in nearly all countries. Though 
some of the figures, especially for service items, are crude, the balance of pay- 
ments generally is based on customs reports on import and export of com- 
modities. These are sufficiently accurate for the purpose of national income 
computation. They do not give rise to major problems although the import 
totals for a number of countries, Cuba, for one, may be significantly under- 
stated owing to attempts to avoid payment of import duties. 

Private consumption expenditures are usually obtained as residuals after de- 
ducting all the foregoing components from gross domestic output. The major 
exceptions are Malaya, Ceylon, and the Dominican Republic. In these coun- 
tries, import statistics and production statistics are used to build up estimates 
of consumption."* 

As residual estimates, there is a tendency for the errors in gross output to 
cumulate in the consumption component. But this is only to the extent that 
the net errors in government expenditures, capital formation, and excess ex- 
ports are zero. The foregoing discussion may be taken to imply that these net 
errors are close to zero, with one exception. This is the omission of most of the 
inventory additions and rural equipment and structures from capital formation. 
To the extent that these are included in the gross output totals, these omissions 
are left in consumption expenditures. However, for most countries these 
amounts may be assumed to be small, especially since most rural equipment 
and structures are not included in gross output, as noted in section 2. Their 
effect on the consumption aggregate, which is typically from 75 to 85 per cent 
of gross output, is negligible. Thus, the previous conclusion, that generally 
speaking total output is significantly understated, applies with special force to 
the consumption component. 


4. CONCLUDING REMARKS 


Despite the aforementioned difficulties, the need for national income and 
expenditure statistics in underdeveloped countries will not diminish, but grow, 
because a measurement of the economy as a whole in value units and the 
quantification of each and all of the structural parts in a common unit are in- 
dispensable. But, as presently constructed, their usefulness is limited. 


16 A direct estimate of private consumption for small countries appears feasible. But more work needs to be 
done to yield better estimates, especially in bringing together family consumption and budget surveys, import and 
retail price statistics, agricultural and production statistics, and population censuses. Work along this direction for 
Cuba is presently under way, 
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This brief survey has concentrated on what are thought to be some of the 
major statistical and conceptual problems, leaving aside a host of minor ones. 
The survey points to the need to examine the adequacy of basic sources of data. 
This is especially true with respect to agriculturai data since the structure of 
production in pre-industrial countries is dominated by agriculture. The paper 
advances the view that the major source of a downward bias resides in the 
agricultural production statistics. 

There is also a need for the statistics of national income and expenditure to 
be presented with more description of the methods of computation and the 
quality of the underlying data. Since subjective error margins are of limited 
value, such information is about the only means by which the users of the 
estimates can be properly warned of the adequacy of the figures for the par- 
ticular uses intended.” 


APPENDIX 


CURRENT ESTIMATES OF MAJOR UNDERDEVELOPED COUNTRIES 


The following references and notes are intended to serve as examples of the generaliza- 
tions and conclusions in the text. It will be seen that a number of tests are applied to the 
estimates to check their plausibility. The writer has found, while working at the United 
Nations National Accounts Branch and subsequently, that the share of government (de- 
fined to include taxes and non tax receipts, on the income side of the consolidated govern- 
ment account), the share of food expenditures (defined to exclude tobacco and alcoholic 
beverages), and the share of personal consumption, all taken as proportions of GNP, are 
fairly stable. The lower the level of productivity of the economy, the higher the proportion 
of food expenditures and personal consumption, and the higher the level of productivity 
the higher the share of government.! Of course, these tesis are rough and not infallible,’ 
but in the absence of convincing explanations to the contrary, they are suggestive of 
statistical inadequacies. From experience, the writer has found these checks to be useful. 

Existing data suggest that for the developed countries food expenditures are around 
one-fourth of GNP; this compares with one-half or more for the undeveloped countries. 
The share of government in GNP is one-fourth or more for the developed countries and 
one-eighth or less for the underdeveloped countries, Between these proportions are coun- 
tries like Puerto Rico which may be classified as semi-developed. 


ASIA 


1. Thailand: Preliminary Estimates of the Gross Geographical Product and Domestic 
National Income of Thailand, 1988-1939, 1946-1950 (1952), by Dr. Joseph 8. Gould, 
Economic Adviser to the National Economic Council. 

Indonesia: Ekonomic Dan Keuangan Indonesia [Economics and Finance in Indonesia], 
June, 1954. Estimates for 1951 and 1952 were prepared under the direction of 8S. Daniel 
Neumark at the request of the National Planning Board of the Indonesian Government. 

Burma: The National Income of Burma, Ministry of National Planning and Religious 
Affairs, Rangoon, 1954. 

The chief difficulty faced by the estimators in all three countvies was the lack of recent 
population censuses. Without labor-force data from population censues, there is no way 
of knowing the approximate magnitude of the nonagricultural sector. Small units of 


17 This article is a summary of a series of papers written over a number of years. Acknowledgment is made to 
J. B. D. Derksen, former chief of the Naticnal Accounts Branch of the United Nations Statistical Cfice, Car! S. 
Shoup of Columbia University, and Simon Kuznets of John: Hopkins, who have read and commented on several of 
these papers. 

1 See the author's forthcoming paper in the American Economic Review (June 1957 issue); and also M. K. Ben- 
nett, The World's Food, (New York, 1954), p. 25, on food expenditures. 

2 For example, the large revenue from exported oil received by the Venezuelan government makes possible a 
share of government (22 per cent of the postwar years) considerably in excess of other Latin American countries. 
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industry, trade, transport, and services dominate the nonagricultural sector of Asian 
countries and only a fraction of the output of these units is covered by existing production 
statistics. 

2. Philippines: Annual Report of the Central Bank of the Philippines for recent years. The 
basic study is National Income of the Philippines and Its Distribution, UN Technical 
Assistance Program, New York, 1952. The second publication contains the estimate for 
the benchmark year, 1948. In 1948, four censuses were held to overcome the gaps in the 
national statistics: census of population, census of income, census of agriculture, and an 
economic census covering the larger units in nonagriculture. Assuming the adequacy of 
the censuses, the 1948 national income total may be accepted as approximately accurate. 
The major difficulty faced by the staff of the Central Bank is the estimation for years 
after 1948. Since no censuses have been taken since 1948, and sample surveys have not 
been established, the estimates for the 1950’s are heavily dependent on 1948 censuses. 
3. Ceylon: National Income of Ceylon by K. Williams, Director of Census and Statistics, 
Ministry of Finance, Colombo, July 1952. 

Malaya: National Income of Malaya 1947-1949, by Frederic Benham, Singapore, 195i. 

Unlike estimates in parts 1 and 2, expenditures on gross national product in these areas 
are directly estimated. Consumption expenditures were obtained by multiplying output 
by retail prices. The available price statistics mainly covered the large cities, so that to 
the extent that price levels are higher than town and village price levels, there is over- 
estimation. On the other hand, agricultural output is probably understated. The down- 
ward bias in the latter may exceed the upward bias in the former, at least for Ceylon 
where the estimated food expenditures come up to only 40 per cent of GNP in the above 
publication. This compares with 79 per cent of family expenditures for 1,000 rural families 
in the 1950-1951 income and expenditure sample survey. (See Yearbook of Labour Sta- 
tistics, 1955, 1.L.0., p. 328.) 

4. India: Final Report of the Natienal Income Commitice, February 1964, Ministry of 
Finance, New Delhi. Various sample surveys indicate that Indian agricultural production 
statistics, like those of other underdeveloped countries, may be seriously understated. In 
other respects, the obstacles faced by the national income staff are essentially the same as 
in the foregoing countries. The Report, together with the First Report, gives a clear picture 
of the difficulties typically confronting national income estimators in Asian countries and 
the various, ingenious solutions attempted. An additional problem for India is the size 
of the country. India may be looked upon as a family of nations, each with its own 
economy. The meaningfulness of aggregating incomes generated in units of rupees for all 
the provinces needs to be examined. 

5. Egypt: National Income of Egypt for 1953, Statistical and Census Department, Min- 
istry of Finance and Economy, December 1954, Cairo. Income originating in agriculture is 
one-third of national income and the share of government (all taxes and non-tax receipts 
of all levels of government) in GNP is one-fifth. For an underdeveloped country, the 
former proportion appears too low (especially since two-thirds of the labor force was 
reported to be in agriculture in 1947) and the latter too high. These together with the 
high share of profits suggest incomplete coverage in the agricultural statistics. 

Turkey: National Income of Turkey, 1938, 1948-1954, Central Statistical Office, An- 
kara, 1955. Government receipts as a proportion of GNP average one-fifth for the years 
covered. This seems on the high side. 

7. Brazil: Revista Brasileira de Economia, December 1953, Instituto Basileiro de Econ- 
omia. The most puzzling aspect of the estimates is the wide variation in per capita income 
of the states comprising Brazil. When the national income total for each state for 1950 is 
divided by the respective state population (from the 1950 census), per capita incomes 
ranging from a low of 1,200 Cr. in Maranhao and Piaui to highs of 13,500 and 7,800 Cr. in 
Distrito Federal and Sao Paulo, respectively, are obtained. This suggests the possibility 
of deficiencies in the underlying basic data, especially agricultural, and also that the 
econemy of Brazil is not an integrated whole. In the latter case, a number of states (about 
half of the states have per capita incomes below 3,000 Cr.) may have price systems only 
tenuously related to the more ueveloped regions of Brazil. The former possibility is sug- 
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gested by the fact that income originating per person in the labor force in the primary 
sector is only one-third that of the rest of the sectors. Since income originating in the non- 
agricultural sectors does not appear to be overstated, it may be that agricultural produc- 
tion statistics made available to the national income staff are deficient in coverage. The 
share of government in GNP is also very high—18.9 per cent for the postwar years. 

8. Peru: La Renita Nacional del Peru, 1942-1952, Banco Central de Reserva del Peru, 
1954. In order to overcome the scantiness of basic data, the Peruvian government sends 
out detailed questionnaires to all registered enterprises for the computation of national 
income. The tabulated results for these units are blown up to cover non-registered units. 
A cursory examination of these figures from the questionnaire reveals very wide fluctua- 
tions in the annual earnings of employees and of employers from various industries. In 
1951, the average income of proprietors in nonagriculture works out to be only one-half 
that of wage-earners in nonagriculture. Even after allowing for low-income own-account 
workers included as proprietors, this situation seems unusual. 

9. Mexico: Economic Development of Mexico, Report of the Combined Working Party, 
1953, published for the International Bank for Reconstruction and Development. These 
figures show a number of surprising results. The share of profits is 41 per cent of national 
income in 1950 and the share of property incomes is nearly one-half. These are excep- 
tionally high proportions. (Statistics of National Income and Expenditure, United Nations, 
Series H, No. 1, pp. 31-37, shows percentage distribution of distributive shares for coun- 
tries with tolerably adequate national income data. For no country is the share of rent, 
profits, and interest greater than one-fourth.) The high share is mainly due to the enor- 
mous size of profits in commerce and services. Income per person in the labor force of the 
latter is more than five times that of agriculture. 

10. Nigeria: National Income of Nigeria, 1950/61, Colonial Research Studies, No. 11, 
1953, p. 20, by A. Prest and I. G. Stewart. The meaningfulness of aggregating incomes and 
commodity values of such a loosely integrated economy as Nigeria is brought into question 
when, according to the authors, there is very wide variation of prices from one month to 
another, one village to another (the price of a commodity in one village may be two or 
three times that in another village on the same day.) If gross product for each of three 
regions (West, North, East) is divided by population (from the 1952-53 Census), per cap- 
ita income in the North is twice, and in the East two-thirds, that in the West. Also, income 
originating in agriculture and fishing per person in the labor force is nearly as large as that 
in nonagriculture (excluding the Northern Region). This is quite unusual for underdevel- 
oped countries, where the former is usually around one-half of the latter. 

11. Southern Rhodesia: Economic and Statistical Bulletin of Southern Rhodesia, Central 
African Statistical Office, February 7, 1954. The production of the native tribes in the 
villages is included as a “notional” estimate in the national income totals. It is not very 
clear what notional means in this context. But if the 1950 total of African incomes (wages, 
proprietors’ income, and subsistence output) and the total of European incomes are di- 
vided by the estimated African and European population, per capita income for the 
former is 1.3 pound as compared to 428.0 pounds for the latter. “Notional” may therefore 
be taken to mean that incomes generated in the African villages do not belong to the same 
price system as those generated in the European sector. This point is also relevant to the 
notional estimate in combined Northern Rhodesia, Southern Rhodesia, and Nyasaland. 
See Monthly Digest of Statistics, Central African Statistical Office, July 1956. Here, there 
is the added problem: what meaning attaches to the aggregation of European incomes of 
the three different colonies? 

12. Belgian Congo: Bulletin de la Banque Centrale des Congo Belge et du Raundi-Urundi, 
October 1956. Income originating in agriculture is 27 per cent, wages and salaries are 
nearly one-half of national income, the share of government is 20 per cent of GNP, and 
private consumption 60 per cent of GNP. These proportions are generally characteristic 
of fairly advanced economies. It seems inconceivable in an economy where more than 80 
per cent of the labor force are reported to be in agriculture. Is the difficulty similar to 
that discussed for Southern Rhodesia? 
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ESTIMATING THE SERVICE LIFE OF HOUSEHOLD 
GOODS BY ACTUARIAL METHODS 


Jean L. Pennock anp Caro. M. JAEGER 
Institute of Home Economics, U. 8. Department of Agriculture 


Actuarial methods are applied to data obtained from two household 
surveys to derive a measure of the service-life expectancy of household 
goods under one owner. This measure is of value to economists con- 
cerned with family budgets, and to manufacturers and distributors 
interested in knowing how often the consumer will enter the market. 
Although the samples obtained in the two surveys were too small to 
yield reliable estimates of service life, the data are used to illustrate 
the problems encountered and techniques used. Recommendations are 
made concerning schedule design and field procedures. 


THE PROBLEM 


HE life expectancy of industrial equipment has been treated in a consider- 
Tits body of literature.' Similar factual information on the life expectancy 
of consumer durable goods is needed, since, for the most part, such estimates as 
exist have been based on opinion rather than fact. Frequently these estimates 
are generalizations on the personal experience of manufacturers, salesmen, 
servicemen, fire insurance adjusters, and other technicians who are familiar 
with the household goods and their performance. Considerable divergence can 
be found among estimates arrived at in this way. Studies of the use potential 
of various types of equipment have been made in laboratories but there is nc 
satisfactory technique for translating findings from such studies into estimates 
of length of service under conditions of use in households. Moreover, these 
goods are often discarded or replaced, not because they are worn out, but be- 
cause the family’s needs or preferences have changed or because the article is 
outmoded. 

To explore the possibility of deriving estimates of service life of household 
goods by the actuarial method, the Household Ecunomics Research Division of 
the Institute of Home Economics collected data on a limited number of house- 
hold articles in two surveys conducted largely for other research purposes. 
From these data, tables of service-life expectancy under one owner were con- 
structed. This paper deals with this experience and considers particularly the 
practicality of the method in relation to problems in the collection of the data 
and to the size of sample needed for reliable estimates. The focus of interest is 
methodological. Such estimates as are given are merely illustrative. No dis- 
tinction has been made between items acquired new and used. With larger 
samples separate estimates could be made for each of these categories. It should 
be noted, however, that the results will not be additive. Because the service 
life under discussion is limited to that under one owner, results will not be of 


1 Some indication of the scope of work done can be obtained from the following references: 

Winfrey, Robley, and Kurts, Edwin B., Life Characteristics of Physical Property, Bulletin 103, lowa Engineering 
Experiment Station, Iowa State College, 1931. 

Altman, Oscar L., and Goor, Charles G., “Actuarial analysis of the operating life of B-29 aircraft engines,” 
Journal of the American Statistical Association, 41 (1946), 190-203. 

Grant, Eugene L., and Norton, Paul T., Jr., Depreciation, New York: Ronald Press Co., 1949. 
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direct interest to economists working with aggregate consumption and savings. 
It should, however, prove useful to families and those who counsel families on 
budgetary problems, since for them the question is how long the individual 
owner can expect to use the item, assuming the prevailing practices in regard 
to its discard. The manufacturer and distributor will also be interested in de- 
termining how frequently the buyer of the item returns to the market. 


DATA USED 


As a beginning, questions on inventory and discards for a selected list of 
household appliances were appended to a general study of income and ex- 
penditures made in 1949 among 461 farm families of selected family types in 
four counties in western Kansas. The pertinent section of the schedule took the 
following form: 


Household facilities and equipment 


(Check (If obtained in the schedule year—) 
Item one) (If yes) 

Year Price Item discarded, Age of 
No Yes acquired per if any item 
article (Specify) discarded 
(a) (b) (c) (d) (e) (f) (g) 


The universe in the second study, made in the latter part of 1954, differed 
sharply from the earlier; it consisted of 210 households in that part of the 
urbanized area of Wilmington within Delaware.’ Since this was primarily a 
study of living room inventories, the items investigated were major items of 
living room furnishings. 

Differences in the nature of the items to be studied and the results of work on 
the Kansas data suggested a different schedule form. Data on the inventory 
and removals were collected separately. The list of items in the inventory and 
their date of acquisition were obtained in a series of sections covering all the 
major items that might be found in living rooms. In these sections data on the 
method of acquisition, cost, and work done to maintain the item, all matters 
not pertinent to this particular problem, were also collected. The removal data 
were obtained in a later section. The questions here were: 


Did you take out of your living room within the last 3 years any of the following items: 


For each “Yes,” ask: 
Item Yes No 


How When was How long had you had it in you 
many? that? living room when you removed 
it? 


2 The data were collected by National Analysts, Inc., Philadelphia, Pa., under contract with the U. 8. Departe 
ment of Agriculture. 
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CALCULATION OF SERVICE-LIFE EXPECTANCY RATES 


The actuarial method as applied to household items hinges on the develop- 
ment of independent removal rates for each age interval throughout the maxi- 
mum life span of the item under study. These age-specific removal rates are 
applied to a hypothetical cohort assumed to be starting life simultaneously, 
reducing it from its starting strength at the beginning of the service life to zero 
at the close of the maximum life span. On this basis the aggregate number of 
years of service by the cohort is obtained and from this the average number for 
its members. The latter is the service-life expectancy at the time the article is 
put into service. 

There are two methods of applying this general technique. One involves 
working from observations on a group of items put into service at the same 
time. This method is impractical from an operational standpoint. It involves 
the periodic collection of observations throughout the life span of the articles 
under study, a monumental amount of fieldwork complicated by difficulty in 
following respondents as the study progresses. Moreover, if the average service 
life of the article is changing either because of changes in the article itself or in 
the conditions of its use, the findings will be of limited usefulness because they 
will not become available while articles or conditions of use similar to those 
studied are in existence. 

The second method, which has been used here, is based on a cross section in 
time. Through the use of arrays of the inventory and recent discards by age, 
the age-specific removal rates necessary to the computation of service-life 
expectancy are obtained. This method also involves the questionable assump- 
tion that there is no change over time in removal rates. The resulting estimate, 
however, is more meaningful than that obtained by the first method since it is 
based on the experience of articles still in service at the date of estimate. It is 
also obtained with a minimum of fieldwork. 

The calculation of the service-life expectancy is illustrated in Table 1, using 
the data from the Wilmington study on curtains and draperies.* The steps in- 
volved in reaching the estimate and determining its standard error are as 
follows: 

Columns 2 and 3 present the data obtained in the survey (with some modifi- 
cations discussed below) ; these data are twofold—the inventory at the date of 
interview and removals during the year preceding interview, both arrayed by 
age. To obtain a count of all items exposed to the risk of removal during the 
year (col. 4), these two columns are added together. The age-specific removal 
rates (col. 5) are computed by dividing the number removed in a given year of 
age by the number exposed to the risk of removal during that year. The age- 
specific survival rate (col. 6) is unity minus the removal rate. 

Column 7 reflects the experience of a cohort of 1,000 exposed to the removal 
rates just calculated. We begin with age 0, the point in time at which the items 
are put into service. The number surviving to age 1 is computed by multiplying 


* This table based on individual curtains or draperies fails to take t of any relationship between the 
number of panels in a room and the frequency with which they are discarded. In this form the data are of maxi- 
mum usefulness to the manufacturer or distributor. The table cou!ld have been prepared making the household 
rather than the panel :he unit, in which case it would be of more service to the homemaker. 
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TABLE 1 


LIFE TABLE FOR CURTAINS AND DRAPERIES IN THE LIVING 
ROOM OF ONE OWNER 


(197 households in the Wilmington urbanized area within Delaware, 1954) 


Of 1,000 units put 
Units* into service— 
in 
inven- Number 
surviving 
to age z 
(Lx) 
(7) (8) 


1,000 
1,000 


— 
~" 


0 
1.0000 
0 


4 
0 
0 
2 
1 
3 
4 
6 
0 
4 
9 
0 
0 
0 
4 
0 
0 


1 All items assumed to have been acquired on January 1 of the year of acquisition. 

2? Number of curtain and drapery panels. 

* The mean service-life expectancy is 6,329/1,000 +.5 or 6.8 years. 

4 When there is no inventory and no removal, the removal rate is indeterminate. For this model a survival 
rate of 1 was assumed, which yields a maximum figure. If a survival rate of 0 had been assumed, the mean service- 
life expectancy would he ve been 6.1 instead of 6.8 years, a decrease of 10 per cent. If the data had been smoothed, 
the estimate would fall between these extremes, 


178 
Age in 
years! : 
= | | 
4 134 
12 4 134 
13 0 1.0000 134 
14 .8000 . 2000 134 
15 0 1.0000 27 
16 0 1.0000 27 
17 0 1.0000 27 
18 be ‘ 27 
19 0 1.0000 27 
20 ‘ ‘ 27 
21 ‘ 4 27 
22 4 4 27 H 
23 4 ‘ 27 
24 1.0000 27 
25 0 27 
26 0 0 | 
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the number entering upon age 0 by the survival rate for age 0, and similarly 
for successive ages. (Col. 7 times col. 6 for age z equals col. 7 for age +1.) 
Column 8 is the cumulative sum of col. 7, starting at the bottom of the column. 
To compute the service-life expectancy at any age the formula is 


S.+1 
+.5. 
L, 


To obtain the service-life expectancy when an item is put into service, compute 
at age 0. 
Since the samples in both studies were small, empty cells and differences in 


the maximum age reported for items in the inventory and items removed 
caused problems.‘ In the first instance, data were transposed from an adjoining 
cell; in the latter, the number of removals necessary to terminate the inventory 
distribution was arbitrarily assigned. These manipulations tended to give 
slightly higher estimates of service-life expectancy and considerably higher ap- 
proximations of standard error than might have been obtained by other devices. 

Irregularities in the data are to be expected in all but the very largest samples 
and smoothing of the data would generally be in order. No smoothing has been 
done in the work presented here, however, since interest is in methodology 
rather than specific findings and smoothing tends to reduce the standard error 
and so to cloud the effect of sample size. 


ANALYSIS OF RESULTS 


An isolated life expectancy estimate has little predictive value. It can be 
used without adjustment only when conditions are similar to the situation 
reflected in the life table. The estimate for mechanicai refrigerators shown in 
Table 2 illustrates this point. This estimate appears to be free from much of 
the bias due to reporting error discussed below, but on the basis of their own 
personal experience many readers will consider this estimate too low. It can, 
indeed, be accepted as lower than “normal,” even though it is impossible to 
define the normal situation, because it reflects the unusual combination of cir- 
cumstances existing when the data on which it is based were collected. Farm 
income was high in the Kansas wheat area when this study was made and hac! 
been high for several preceding years, with the result that funds were available 
for the purchase of household equipment. Electric power lines had only recently 
been put through a large part of this area, so that many families found them- 
selves in a position to use electrical equipment for the first time. The equipment 
itself had been scarce during and immediately after the war, but was readily 
available by the time of survey. This combination of circumstances led to 
heavy purchasing of equipment; over a third of the families in the survey 
bought a mechanical refrigerator during the schedule year. Undoubtedly many 
of the mechanical refrigerators discarded were gas or gasoline models that 
would have been continued in service had the families not wanted to change 


¢ When more is known about the relationship between age of head of the household and age of inventory or 
between age of dwelling unit and age of inventory, it might be possibie to circumvent this difficulty by stratified 
sampling. 
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over to electrical models. As a consequence the age-specific removal rates were 
high even in the initial portion of the age distribution, and service-life expect- 
ancy based on this experience is relatively low.' 

Although a single estimate of service-life expectancy has only limited useful- 
ness, repeated observations wil! indicate secular trends and the relationships 
between various economic, social, and psychological factors and service-life 
expectancy. To illustrate from another field, life tables for the U. 8. population 
based on the successive decennial censuses indicate the improvement in life 
expectancy in recent years and furnish a better basis for prediction of life ex- 
pectancy in the future than the most recent life table used alone. Repeated ob- 
servations on the life expectancy of household equipment might be expected to 
show similar trends attributable to mechanical improvements; it is possible, on 
the other hand, that the service life of some items is becoming shorter as style 


TABLE 2 


MEAN SERVICE-LIFE EXPECTANCY OF SELECTED ITEMS OF 
FURNISHINGS UNDER ONE OWNER 


Study Mean life 
Item furnishing expectancy 
data 


Living room curtains and draperies Wilmington? 
Sofas and couches in living rooms Wilmington’ 
Living room rugs and carpets Wilmington? 
Tables in living rooms Wilmington* 
Mechanical refrigerators Kansas* 
Gas cookstoves Kansas‘ 
Radios Kansas‘ 


1 For Wilmington data, life expectancy of article in the living room of current owner; for Kansas data, life 
expectancy in the dwelling of current owner. 

2 197 households in the Wilmington urbanized area within Delaware, 1954. 

+ 210 households in the Wilmington urbanised area within Delaware, 1954. 

¢ 461 rural households of farm operators of selected family types in Meade, Gray, Edwards, and Ford Counties 
in Kansas, 1948-1949. 


factors assume increasing importance. Since under usual conditions of house- 
hold use, furnishings and equipment are discarded before they reach a state 
of absolute unserviceability comparable to biological death, service-life expect- 
ancy rates reflecting high economic levels when replacement is easy will be 
lower than those for periods of depression. 

The foregoing are general limitations on the application of estimates of 
service-life expectancy. The estimates presented in Table 2 have additional 
drawbacks in being computed from very small samples and in combining data 
for new and used items. There is also reporting error in the data on which they 
are based. 

The nature and amount of reporting error and its effect on the estimate will 
be considered here in some detail because the construction of a life table from 


5 It will be noted that the estimate is for mechanical refrigerators. Had the data permitted a separate estimate 
for electrical refrigerators, service-life expectancy would probably have been longer. 


6.8 
13.5 
12.9 
32.9 
9.2 
15.9 
10.6 
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data obtained by recall techniques rather than from current records is a major 
departure and there has been little experience in obtaining discard or removal 
data from consumer respondents. In the two sets of data there appear to have 
been four general types of reporting error: 
. Reporting of removal data even though the removal took place outside 
the period for which such data were asked. 
. Failure to report removals that took place within the period for which 
such data were asked. 
. Failure to report age of items in the inventory and of items removed. 
. Inexact reporting of the age data for both inventory and removals. 

Reporting of removal data even though the removal took place outside the 
period for which such data were asked occurred with both schedule designs but 
more frequently with the Kansas design. This type of error creates little prob- 
lem if the schedule year coincides with the calendar year; the incorrect reports 
can then be identified and edited out. When, as in the Kansas survey, the 
schedule year and the calendar year do not coincide and it is known that re- 
movals outside the schedule year have been reported, it is impossible to deter- 
mine which of the reports for the calendar year that is partly in and partly out 
of the schedule year are incorrectly entered. This error, if uncorrected, over- 
states the age-specific removal rates anc emneneney understates the service- 
life expectancy. 

Partly because the linking of the questions on removals to the questions on 
acquisitions resulted in this type of error, the schedule design was altered on 
the second study; the two lists were obtained independently, the removal list 
in answer to the question “Did you take out of your living room within the 
last 3 years any of the following items?”* This second form led to a serious un- 
derreporting of removals which has the effect of reducing the age-specific 
removal rates and so overstating the service-life expectancy. 

The evidence that there was underreporting of removals in the Wilmington 
study is twofold—from examination of the individual schedules for internal 
consistency, and from a consideration of the tabulated data. 

There are three schedules from families long established in their homes that, 
if taken at their face value, indicate that before the purchase of sofas and chairs 
in 1954 these people had nothing in their living rooms on which they could sit. 
In these cases there can be no doubt that the respondents failed to report the 
removal of the pieces that the 1954 purchases replaced. These are the extreme; 
grading down from them are others in which the 1954 purchases may represent 
additions to meagerly furnished rooms but more probably are replacements of 
items whose removal is not reported. It seems apparent that with many families 
it was “out of sight, out of mind”; they could not remember the former appear- 


* A difference in the nature of the items covered by the Wilmington study also made the change necessary. In 
the Kansas study, removal data were called for only when the removal was made in conjunction with the acquisition 
of a similar item during the schedule year; removais for which there was no immediate replacement and removals 
to make room for an article of a different kind (two chairs in place of a sofa, or a television set in place of a radio, 
for example) would not be reported. These losses were not considered important as few of the items studied would 
be discarded independently (radio is the tion) or replaced by a wholly different type of thing. This is not 
true of the items in the Wilmington survey; the number of removals without immediate replacement in kind could 
be expected to be an appreciable part of the total number. 
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ine'el eabnenin and so could not report what had been taken out.’ 

Table 3 shows the magnitude of the year’s apparent gain in inventory among 
the Wilmington families. Some net gain is undoubtedly to be expected because 
of young families still in the process of furnishing their homes, the larger living 
rooms in the detached, single-family dwellings that form the bulk of new con- 
struction, and the slow accretion of possessions among established families. 
There will be little compensating loss because families moving into smaller 
quarters will tend to crowd in as many of their lares and penates 53 they can 
and the survey fails to tap the greatest single source of decrease in inventory, 
the families that go out of existence during the schedule year. Nevertheless, the 
increases shown in these data appear to be excessive. Such increases, if real, 
would soon result in substantial increases in the average stock of furnishings. 
They can be considered reasonable only when supported by changes in styles 

TABLE 3 


PURPORTED CHANGE IN INVENTORY IN THE SCHEDULE YEAR 
(210 households in the Wilmington urbanized area within Delaware, 1954) 


— Calculated Reported removals Reported acquisitions F 1 Net 
opening closing | gain in 
inventory | All |Replaced| Not An | Replace-| Addi- tory| year 
Number of articles 
Curtains and draperies! +? 718 128 108 20 205 108 97 795 77 
Sofas and couches 170 12 8 4 17 S .) 175 5 
Rugs and carpets 332 30 15 15 43 15 238 345 13 
Tables 618 8 7 1 71 7 64 681 63 
Number of families having 

Curtains and draperies* 144 26 23 + 39 23 19 156 12 
Sofas and couches 162 12 8 4 17 8 9 170 8 
Rugs and carpets 188 16 ll 7 32 ll 21 202 “4 
Tables 197 + 3 1 34 3 32 208 il 


1 Number of curtain and drapery panels. 
1 13 schedules excluded. 
in housing or furnishings affecting substantial numbers of families.* 

Further indications of the tendency to understate removals can be found in 
such facts as that the greatest increases occurred in the items of smallest size 
and value, curtains and draperies, and tables. Another clue is furnished by the 
relationships between age distributions in the inventory and the estimate of 
service-life expectancy. For example, the age distributions of the two items, 
sofas and couches, and tables are very similar (fig. 1), but the service-life ex- 
pectancy is quite dissimilar. Unless the similarity in the age distributions is a 
passing phenomenon, and there is no reason to believe that it is, the estimates 
of service-life expectancy should be more similar. 


* It is possible that some small! part of this error is the result of improper presentation of the questicn by the 
interviewer or incorrect interpretation of it by the respondent. The interviewer may have rephrased the question 
“Did you take out of your living room .. . ” to “Did you discard ... ,” or the respondent may have equated “take 
out of the living room” with “discard.” In either case, items moved to other parts of the dweiling would not be 
reported as removals. This error could occur only in a survey limited to a part of the dwelling. 

* Buch long-term style changes do lend some support to the two largest increases, in curtains and draperies, 
and tables but because of the small number of families involved, cannot be considered to explain the full increase. 
The newer housing, being for the most part detached units, tends to have more windows to be curtained. Also 
many of the newer houses and apartments have no ceiling lighting fixtures in the living rooms, which fact taken 
in conjunction with the style trend from floor to table lamps, supports some real increase in the number of tables, 
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Fic. 1. Age distribution of tables and of sofas and couches in the inventory, 210 
households in the Wilmington urbanized area within Delaware, 1954. 


In both studies there were failures to include the age of the item in reports 
on the inventory and removals. The handling of items of unknown age poses a 
problem because there can be no question but that the proportion so reported 
increases with the real age of the item and because there are relatively more 
failures to report age on removal than age in the inventory. These biases in 
combination, if uncorrected, result in an overstatement of the service-life ex- 
pectancy by reducing the age-specific removal rates at all ages but particularly 
at the upper ages. Distribution of the unknowns in proportion to those of 
known age results in an overcorrection, since the age-specific removal rates at 
the lower ages will then be too high. A distribution of the unknowns skewed to 
the higher ages is indicated. In Table 1 and in calculating the service-life ex- 
pectancies shown in Table 2, items of unknown age have been dropped because 
there are no data to determine the proper skewness of their distribution. The 
assumption that in each distribution 10 per cent of the unknowns belong in the 
first quartile, 20 per cent in the second quartile, 30 per cent in the third quartile, 
and 40 per cent in the fourth quartile reduces the estimate of the service-life 
expectancy of curtains and draperies approximately 10 per cent. The same 
assumptions would effect smaller reductions in other items because curtains 
and draperies showed one of the highest proportions of items of unknown age 
and the greatest disproportion between the number of unknowns in the two 
distributions. 

An evaluation of the accuracy of the age data would be valuable since these 
are the fundamental data on which the actuarial method is based and there is 
little experience with their collection in surveys. Unfortunately such an evalua- 
tion is difficult. It is obvious from the peaking at quinquennial and decennial 
years in the age distributions that there was a tendency to report age in round 
numbers. Whether there were other biases is harder to determine. Trade sales 
data indicate that the age distributions obtained in the two surveys are within 
reason, but the relationships are of necessity very crude. 


Tables 
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Errors in the reporting of age become important only if they introduce a 
systematic bias, or if they tend to be more frequent in one distribution than in 
the other. Rounding, if there is an equal tendency to round up and down, will 
have little effect on the estimate of service-life expectancy. 


USEFULNESS OF THE TECHNIQUE 


On the basis of the data obtained in the two surveys reported on above, 
the indications are that the actuarial technique can be applied as well to houso- 
hold goods as to industrial inventories. There are certain problems in its appli- 
cation, however. 

In the first place, the design of the schedule and the training of the inter- 
viewers must be improved over the levels attained in the studies reported on. 
Especially when the questions on removals cannot be tied to affirmative answers 
on acquisitions, techniques should be sought that will aid the respondent in 
reconstructing the situation at the beginning of the schedule period and the 
interviewer in keeping in mind recent purchases which might indicate related 
removals. 

The indications are that if the data being studied are of the type that lend 
themselves to the simpler schedule form used in the Kansas survey, the collec- 
tion of data for the computation of service-life expectancy rates can be com- 
bined with the collection of related data in a single survey. The schedule form 
used in Wilmington introduces a more complex interviewing situation, and to 
keep reporting error within acceptable limits it seems necessary that these data 
be the sole, or at least the primary, focus of attention. 

Useful as service-life expectancies of household goods would be, it would 
appear that the large samples needed would make them prohibitively expensive 
except as the data can be collected in conjunction with other purposes. For 
example, on the basis of tr= Wilmington experience, the sample size needed, if 
allowable difference is limited to one year at the 95 per cent level of probability, 
would be 300 households for living room curtains and draperies, 6300 for sofas 
and couches, and 7300 for living room rugs and carpets.° 

In neither study was an attempt made to obtain data on the type of item 
that is usually owned in quantity and whose units have little or no individual- 
ity. It seems self-evident, however, that there would be considerable difficulty 
in applying the actuarial method to such items as sheets or towels. An accurate 
report on the number discarded and the ages of items in the inventory or dis- 
carded would be impossible to obtain. To apply the method to this kind of item 
would require the keeping of records, and the interpretation of data from those 
who are willing and able to keep such records presents additional problems.'° 
Better estimates of the average service life of housefurnishings of this type can 
undoubtedly be obtained by the use of either of two other techniques, computa- 


* The formula for the standard error used in estimating sample size is that shown in the article by J. O. Irwin‘ 
“The standard error of an estimate of expectation of life, with special reference to expectation of tumourless life 
in experiments with mice,” Journal of Hygiene, vol. 47 (1949), 188-89. Irwin's formula is for a different kind of 
sample, but it has been adopted here as a rough approximation, probably underestimating the correct values. 

1° Pennock, Jean L., Brew, Margaret L., and Tillinghast, Rose C.; Farm Family Spending and Saving in Illi- 
nois, with a Comparison of Survey and Accounts Data, Agriculture Information Bulletin 101, U. 8. Department of 
Agriculture, 1954. 
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tion from the inventory-acquisition ratio," and from a distribution of the dates 
of last purchase.” If inventories are maintained at a constant level and there is 
regularity in the pattern of acquisition, these methods will give the same results 
as the actuarial method. While they tend to give a distorted estimate wher 
the level of inventory is changing,” it is probable that for many of the rela- 
tively minor items of housefurnishings this distortion would be less than that 
caused by the gross inaccuracy in the age data needed for the life table. The 
actuarial method, however, seems to be the method of choice for the many 
large and relatively expensive items that make up the bulk of the household 
inventory. 


4 Brew, Margaret L., O'Leary, Roxanne R., and Dean, Lucile C.; Family Clothing Inventories and Purchases, 
with an Analysis to Show Factors Affecting Consumption, Agriculture Information Bulletin 148, U. 8. Department 
of Agriculture, 1956. 

12 Stock, J. Stevens, “Non-Sampling Errors in Sampling for Business” in Business Applications of Statistical 
Sampling Methods, Allerton Park Conference, May 25 and 26, 1950. University of Illinois in cooperation with the 
American Statistical Association. 

4 To iliustrate from the extreme example used earlier, under the conditions of rapid change encountered in 
western Kansas in 1949 the life table gives an estimate of service-life expectancy for refrigerators of 9.2 years but 
the inventory-acquisition ratio indicates an average life span of 3.4 years. 


AN APPLICATION OF MULTIVARIATE QUALITY CONTROL 
TO PHOTOGRAPHIC PROCESSING 


J. Epwarp AND H. Morris 
Eastman Kodak Company 


There are a large number (p) of correlated variables to be controlled 
in a photographic process. Independent control charts for each would 
lead to a considerable misstatement of the Type I error for the system 
as a whole. The use of Hotelling’s 7’ may involve computational prob- 
lems if the determinant is near zero. Since this would imply that the rank 
of the matrix is less then p, k <p linear combinations of the original 
variates are obtained by the method of principal components. These 
new variates are used both for short and long range control with some 
simplifications when the short range procedures are to be handled by 
production personnel. 


INTRODUCTION 


HE production of a color photograph in the form of a color transparency 

can be considered as consisting of three major phases. These three phases 
are: the manufacture of the light-sensitive film, the exposure of the film in a 
camera, and the chemical processing of the exposed film. All of the variables 
that enter into the three phases of color photography contribute to the appear- 
ance of the finished picture. In particular, the appearance of the final result is 
quite sensitive to changes in the chemical composition of the processing solu- 
tions. Further, nonnegligible variabilities in these chemical compositions are 
introduced not only by such things as variability of weighing and mixing solutes 
and of replenishment rates, but also by variations in the subject matter of the 
pictures being processed. 

The problem of statistical quality control of processing solutions is to dif- 
ferentiate (with known risks) between observed variations that are consistent 
and those that are inconsistent with the inherent variability of the process. 
Process variations are observed in two ways. First, clemical analyses of the 
processing solutions are made wherever practical. This approach leaves much 
to be desired because few analytical methods are sufficiently quick, inexpen- 
sive, and sensitive. The second method consists of processing a film strip that 
has received a set of precise standard exposures. Optical densities of the proc- 
essed strip are measured and compared with standard values. This method is 
cheap, relatively rapid, and quite sensitive. Its chief disadvantage lies in the 
difficulty of interpreting the results. 

The most common procedure for a photographis check on the process is as 
follows: the test strip is given a graded series of exposures to white light and is 
processed. Optical densities of the resultant “steps” are then measured through 
red, green, and blue filters having narrow transmission bands. These “red,” 

“green,” and “blue” densities are then plotted as functions of exposure (see 
Fig. 1). 
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Fia. 1. Typical characteristic curves of a color product. Red, Green, and Blue 
integral densities are plotted against logarithm of exposure. 


Various suggestions have been made from time to time on how to treat these 
data. One technique consists of comparing the plots visually with an overlay 
of standard curves. This, of course, is imprecise and highly subjective. Another 
technique consists of determining slopes and intercepts of straight lines drawn 
through the curves. This too is imprecise and subjective because of the non- 
linear form of the curves. Neither of these methods is adaptable to the inclusion 
of “steps” exposed to colored rather than white light. 

A third method consists of plotting each density measurement (or a subset 
of them) on a separate control chart constructed by conventional methods. 
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This method is objective and is easily adapted to any number or type of 
“steps.” Its disadvantage is that it gives more than one answer to the question 
“Is the process in control?” Further, if each set of control limits is set for a par- 
ticular value, alpha, for the probability of a type I error, what is the proba- 
bility (on the null hypothesis) that the set of p points obtained from a given 
control strip are all within limits? It is certainly not alpha. 

The problem is thus to develop a control chart technique that will be ap- 
plicable to several variates simultaneously, assuming these variates to have a 
multivariate normal distribution that can be set at any desired probability for 
a type I error. 


APPLICATION OF VARIOUS MULTIVARIATE TECHNIQUES 


One approach to this problem would be to assume that the p variables are 
mutually independent. They may then be standardized so that, on the null 
hypothesis, each is normally distributed with zero mean and unit variance. 
Using sample estimates of the population parameters for the standarization, 
the sum of their squares is then asymptotically distributed like chi-square, and 
a single control chart can be easily constructed for this sum of squares. Al- 
though this technique has been successfully applied to other situations, it does 
not work at all with this problem. The reason for its failure is, of course, that the 
p variables are not independent. They are, in fact, very strongly correlated. 

This circumstance suggests the use of Hotelling’s T? as the appropriate 
statistic for control chart use [1]. This statistic is computed from the equation 


T? = (x — — x)’ = dS—d’ 


where z is the row vector whose elements are the observed values of the p 
variables, zo is the corresponding row vector of standard or null hypothesis 
values of the p variables, and where S is the sample covariance matrix of the p 
variables [3] [6]. The prime denotes matrix transposition. The use of this 
statistic gave rise to two difficulties. The determinant of the covariance matrix 
was found to be very small so that the computation of the inverse was inac- 
curate, and it was found that certain variations which were apparently sta- 
tistically significant were completely negligible as far as the visual appearance 
of pictures was concerned. The smallness of the determinant of the covariance 
matrix indicated that the variates were so strongly correlated that some of 
them could be dropped from consideration. This, however, the process engineers 
were loath to do on the basis that this correlation might not hold for large 
process variations; so still another approach was tried. 

The final approach consisted of applying another of Hotelling’s develop- 
ments, that of principal components [2]. A new set of k variates (k<p) is 
obtained by means of a linear transformation of the original variates. Since 
the transformation is linear, the multivariate normality of the new variates 
follows from that assumed for the original variates. 

Mathematically, the transformation may be written 


y = (x — 
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where y is the row vector whose elements are the new k variates and where W 
is a pXk rectangular matrix whose columns are the properly normalized char- 
acteristic vectors corresponding to the first k largest characteristic roots of the 
sample covariance matrix S. 

That is 


SW =WA 


where /\ is a diagonal matrix whose diagonal elements are the first k character- 
istic roots of S. The normalization is such that 


WW = 


Then W/SW=W'WA=A-"'A =I, a unit matrix. Thus the k new variates are 
normal, have zero means and unit variances on a sample basis, and are inde- 
pendent again on a sample basis, since the covariances are all zero. The sum of 
squares of these new variates therefore has an asymptotic chi-square distribu- 
tion with k degrees of freedom. That is, it approaches the chi-square distribu- 
tion as the sample covariance matrix approaches its population value. 

The exact distribution of this sum of squares is the T? distribution since the 
sum of squares can be written 


T? = yy = 


where the unit matrix, J, is the sample covariance matrix of the y’s. The per- 
centage points of the 7? distribution can be found through the use of the F 
distribution as indicated by Hotelling [1]. As in the case of the original p-vari- 
ate T? solution, a single control chart with a prescribed alpha can be con- 
structed for yy’. The degeneracy or near degeneracy of the matrix S has been 
allowed for by using a W with only k columns. That is, if one or more linear 
relationships are satisfied or very nearly satisfied by the original variates, it 
may be expedient or even necessary to use fewer than p new variates. 

The variability unexplained by the first k characteristic vectors should also 
be taken into consideration. This residual is equal to dS~'d’—yy’ or also to 
Yratyeset +++ +y,? if the latter quantities had been obtained. Hither of 
these expressions involves a great deal of computation and, if p is large, might 
be out of the question altogether. No use can be made of the residual covariance 
matrix resulting from the extraction of the k vectors as it is singular and cannot 
be inverted. For the specific problem discussed below, the assumption was 
made that the individual residuals were, for all practical purposes, uncorre- 
lated. (This was verified for this particular problem by actually computing 
the true residuals. However this assumption might not be valid in some other 
instances.) Faced with the choice of an approximate control tool or none at all, 
the following check on the residual was employed: 


Residual Sum of Square = 


Trace (S) — Trace (/\) 
This is approximately distributed like x? with p—k degrees of freedom. 
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ILLUSTRATIVE EXAMPLE 

To evaluate the quality of pictures produced by a photographic process the 
density is measured of selected steps of a test strip such as that shown in 
Fig. 1. The steps are selected to keep the data to a comprehensible minimum 
and yet to describe adequately the factors most likely to influence picture 
quality. Experience has shown that measurements of one step at the high 
density portion of the characteristic curve (shadow areas), one step in the 
middle-tone portion of the curve (average picture density), and one step in the 
toe or low density portion of the curve (highlights, fleshtones, whites, etc.) are 
sufficient. There are then nine measurements in all, three density levels and 
three colors at each level. 

Photographic control of these processes is carried on in two ways. One is the 
so-called “short-range” control wherein a sample strip comes off each processing 
machine once every half hour. Observations on these strips are made by control 
men at the processing machine; they make any necessary adjustments accord- 
ingly. In the past the control men have had to draw conclusions from nine 
separate measurements and infer the status of the process as regards contrast, 
color balance, etc. In addition, a “long-range” control system is employed by 
technical personnel who process one strip at a random time each day—a sort 
of sampling inspection of a quality control operation. These data are followed 
by the technical personnel for the purpose of studying long-term trends in the 
process. It is on the long-range data that these multivariate investigations were 
first carried out, and from which this example has been taken. 

The rational subgroup in this case was selected as one week. The machines 
run fairly continuously from the time they start up on Monday morning until 
they are shut down Friday or Saturday night. This Monday morning start-up 
can be quite complicated, as there are over one hundred chemical variables 
alone. It is very difficult to exactly match one week’s level of operation the 
following week, but once the process has attained equilibrium for the week it 
is fairly stable about that level. There are, then, two components of variance— 
the inherent variability within weeks and the variability between weeks; the 
latter has an assignable cause, the Monday morning start-up. It is the current 
aim to control the process within the limits which it may be expected to vary 
within a week. 

The base period for this analysis consisted of 31 weeks. The within-week 
covariance matrix was computed with 108 degrees of freedom. (Some data were 
missing.) This represents the average within-week variability to which the 
process should conform. The matrix, sufficiently coded to give all integers, is 
shown in Table I. (Elements given by symmetry are not shown.) It can be seen 
that most of the covariances are positive and that the variances change quite a 
bit from one portion of the curve to another. The corresponding correlation 
matrix is shown in Table II. It is to be noted that high positive correlations 
exist between many of these measurements. The shoulder and middle-tone 
measurements are quite well correlated with each other, but the toe measure- 
ments are somewhat independent of the others. After obtaining several char- 
acteristic vectors and examining the resultant residuals, it was decided that 
five vectors would sufficiently describe the variability of the process. They are 
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TABLE I 
WITHIN-WEEK COVARIANCE MATRIX (CODED) 
Shoulder Middle-Tone Toe 
Red Green Blue Red Green Blue Red Green Blue 
177 179 95 96 53 32 -—7 —4 -3 
419 245 131 181 127 —2 1 4 
302 60 109 142 4 4 ll 
158 102 42 4 3 2 
137 96 4 5 6 
128 2 2 8 
34 31 33 
39 39 
48 
TABLE II 
WITHIN-WEEK CORRELATION MATRIX 
Shoulder Middle-Tone Toe 
Red Green Blue Red Green Blue Red Green Blue 
1 .66 .41 .57 .34 -21 — .09 — .05 — .03 
1 .69 -51 -75 — .02 -03 
1 .27 .54 .04 .04 .09 
1 .69 .30 .06 .04 .02 
1 .06 .06 .07 
1 .02 .02 -10 
1 .85 .81 
1 .89 
1 
TABLE III 


CHARACTERISTIC VECTORS OBTAINED FROM COVARIANCE MATRIX 


Shoulder 


Middle-Tone 


Red Green Blue 


|| 

Toe 
| Green’ Blue | Red Green Blue 
3.2 7.0. 5.2 2.8 3.5 2.9 0 | 
V2/d2 -11.0 — 3.4 18.3} -—11.1 .8 8.4 1.3 1.2 2.0 
11.5 5.2. 6.5] —-12.9 -13.8 -7.6|) -—7.0 -—7.2 -—7.7 
8.6 1.3 3.6) — 3.1 — 8.6 —7.4 13.8 15.8 17.3 
Vs/As 12.7 —22.8 15.0 16.0 — 5.8 §.5| -1.8 -2.5 —1.9 
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shown in Table III where each vector has been divided by its own character- 
istic root. This transformation on the original variables results in a set of 
variables which have zero mean and unit variance. The properties of these 
vectors are as follows: 


y: is essentially a density shift in the shoulder and middle-tone portions of 
the curve; the coefficients are all positive. 


y2 is, in photographic parlance, a warm-cold color-balance shift. Excessive 
blue density in the picture will result in the picture being too yellow, which 
is considered to be a “warm” color. The lack of blue density will produce a 
blue or “cold” picture. Excessive red density will produce a blue-green pic- 
ture, which is also considered “cold,” and lack of red density will produce a 
red or “warm” picture. This vector is essentially the relationship between 
the red and blue densities, independent of the green. Again this is in the 
shoulder and middle-tone portions of the curve. 


ys has positive coefficients in the shoulder and negative in the middle-tone 
and toe. This describes the photographic contrast. High contrast results in 
darker shadow or lighter highlights so that much detail is lost in the picture. 


ys has positive coefficients for the toe and shoulder and negative in the 
middle-tone. Although this appears to be a buckle in the curve, it represents 
more nearly a density shift in the toe. 


ys represents a green-magenta color-balance shift relative to variations in the 
other colors. This, then, is another color-balance parameter which is orthog- 
onal to the warm-cold one. 


The contributions of each variable to each of these vectors are shown in 
Table IV; this illustrates why y, was considered to be essentially a toe shift. 
Unfortunately, the photographic importance of each vector is not porportional 
to its length. An over-all density shift is not nearly as noticeable as the same 
amount of shift in one color alone. Photographically, the fifth vector is prob- 
ably the most important of the five as far as this particular process is concerned. 


TABLE IV 
PROPORTION OF COVARIANCE MATRIX EXPLAINED BY EACH VECTOR 


. Percentages 
Trace 
of Shoulder Middle-Tone Toe 
Matrix 

Red Green Blue | Red Green Blue | Red Green Blue 
Vi 61 46 90 68 38 67 51 0 0 0 
V2 14 26 1 22 30 0 21 2 2 3 
V; 9 12 1 2 17 23 8 24 22 20 
Vs 7 5 0 1 1 6 4 59 68 67 
Vs 6 6 & 5 11 2 2 0 1 0 
Unexplained 3 5 0 2 3 2 14 15 7 10 
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Fia. 2. Process control chart for original optical densities. 3c limits based on 
within-week variability. 
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The decision of how many vectors to use is somewhat arbitrary but generally 
enough will be used so that each individual residual variance will be reduced 
to a tolerable level. 

Now, how can these be used as a control tool? An eight-week period of opera- 
tion in terms of original densities is shown in Fig. 2. (This eight-week interval 
follows the base period.) The red and green shoulder lost density until the sixth 
week when they went out of control after which they behaved quite normally. 
The blue was more variable. The red and green middle-tone appear to be all 
right except for the sixth week. The blue is still quite variable. The toe density 
appears to be on the high side for all three colors, even during the sixth week, 
but is all right after that. 

The use of transformed variables yields the control charts shown in Fig. 3. 
These are no longer in order by size of their characteristic root but are grouped 
depending on whether they are a density or color type of measurement. As has 
already been noted, the process got a little lighter until the sixth week when it 
went out of control. The contrast appears to have been quite a bit lower than 
standard a couple of times and the toe density, as mentioned, ran on the high 
side for the first period. The process runs intermittently green and magenta 
with green predominating and seems to trend from warm to cold about once a 
month. This would be hard to infer from the original variables. It agrees gen- 
erally with the original variables in that when one set is out of control the other 
is and vice versa. However, it is much easier to see the nature of out of control 
conditions this way. These parameters reflect, much more readily, chemical and 
mechanical changes to the process. 

In addition, an over-all measure of control is desirable. The sums of squares 
of the y’s (when there are this many degrees of freedom) is approximately dis- 
tributed as x? with five degrees of freedom. In addition, the residual sum of 
squares 


(p — k)(dd’—yAy’) 
Trace (S) — Trace (/\) 


as stated above, is distributed approximately as x? with four degrees of freedom. 
The two measures are independent so their sum is approximately distributed 
as x? with nine degrees of freedom. These three measures are shown in Fig. 4. 
If the residual sum of squares goes outside the limits, then some photographic 
variation other than the five being controlled should be investigated. If it goes 
out of control very often, then it should be concluded either that more vectors 
should be controlled or that the system has changed and that a new vector 
space should be found. In this case, the residual sum of squares is fairly well in 
control so it was felt that the five vectors were sufficient. 

The observations are all independently obtained and assumption of nor- 
mality is being made which might not necessarily be valid. Analogous to uni- 
variate control charts for averages and ranges, use should be made of some of 
the various 7’ statistics Hotelling has derived. In his bombsight paper in the 
book Techniques of Statistical Analysis, [3] he lists three of them: 74? which 
in this case represents the over-all variation within a week about the aim. If 
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Fig. 4. T? control chart. 99.7% limits based on within-week variability. 


one check a day is processed, the weekly 7? should be distributed approxi- 
mately like x? with 45 degrees of freedom. This measure can be broken down 
into two other statistics representing level and dispersion. 7'y? represents the 
generalized distance between the weekly means and the aim and is distributed 
approximately like x? with nine degrees of freedom. T'p? represents the variabil- 
ity about the weekly mean and is distributed approximately like x? with 36 de- 
grees of freedom. Hotelling has since shown that the convergence of 7'o* and 
Tp? to x? is quite slow even for moderate sample sizes [4]. However, the exact 
distribution for the nine-variate case has not yet been derived. 

These 7” statistics are plotted in Fig. 5. First, looking at 7'o?, there was more 
variation than could have been expected within weeks and, by this yardstick, 
the process was out of control the sixth week. The varying limits are caused by 
the fact that the process was not running full time during this period, running 
anywhere from three to five days a week so that the sample size varied from 
week to week. A look at 7'y? will show that the means do not conform to stand- 
ard more than about half the time. 7’p? is in control. It can be seen that the 
trouble is with 7'*, not 7p’, verifying statistically the statement made earlier 
that the process is stable within a week, but that it is difficult to repeat the 
level from week to week. The aim, then, is to reduce 7'y* which has an assign- 
able, although at the present unknown, cause. 

This system is all very well and good for long-range work but for short-range 
control, where the control man has a strip ¢oming off every 30 minutes from 
each of as many as a dozen machines, something simpler must be used. There- 
fore, simple approximations must be made which correlate well with each of the 
original vectors, are relatively independent of each other, and can be computed 
without the aid of a desk calculator. Returning to Table III for the moment, 
it can be seen that the first characteristic vector, representing optical density, 
is essentially a weighted sum of the nine original measurements. These weights 
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for the toe density are very small and could probably be disregarded. The other 
coefficients are roughly of the same magnitude. For short range work, it is 
practical simply to use the sum of the middle-tone and shoulder. Over the base 
pericd, a measurement of this type showed a correlation of 0.99 with the first 
vector. By similar approximations, the following transformations were ob- 
tained: (This is called the C.D.C. system which stands for Color-Density- 
Contrast: R, G, and B refer to Red, Green, and Blue densities; S, M, and T 
refer to Shoulder, Middle-Tone, and Toe densities.) 


Density (corr. with y, =.99) =Rs+Gst+Bst+Rut+Gut+Bu 

Contrast (corr. with y;=.75) =(Rs+Gs+Bs) —(Ru+Gut+Bwm) 

Toe (corr. with y.=.85) =Rr+Gr+Br 

Green-Magenta (corr. with ys = .78) = 2(Gs+Gmu) —(Bs+But+Rs+Rm) 
Warm-Cold (corr. with yz:=.97) =(Bs+Byu) —(Rs+Ru) 


There is no doubt that these are easy to compute. It is important that these 
approximations be fairly independent of each other. It is all right to get simple 
approximations but if they are all correlated, then the situation is little or no 
better than with the original nine variables. In this case only two of all the 
possible correlation coefficients between them are significant. Fig. 6 shows the 
relationship for the eight weeks’ data between the two measures of green- 
magenta variation which was about as poor as any pair. The negative correla- 
tion is the result of the manner in which the C.D.C. parameter was chosen. 
Better approximations to these vectors could have been obtained but these 
have the distinct advantages of being both easy to calculate and fairly inde- 
pendent of each other. The same eight weeks’ data by this system are shown in 
Fig. 7. These show just about the same process characteristics as did the 
principal components. Variations unexplained by these five variates are de- 
tected both by the long range control program and by the plotting once an 
8-hour trick of all of the data from one selected test strip. 
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MAXIMUM LIKELIHOOD ESTIMATES FOR A MULTIVARIATE 
NORMAL DISTRIBUTION WHEN SOME 
OBSERVATIONS ARE MISSING* 


T. W. ANDERSON 
Columbia University 


EVERAL authors recently have derived maximum likelihood estimates of 

parameters of multivariate normal distributions in cases where some ob- 
servations are missing (Edgett [2] and Lord [3], [4]). The purpose of this note 
is to give an approach to these problems that indicates the estimates with a 
minimum of mathematical manipulation; this approach can easily be applied 
to other cases. (The technique bears some resemblance to that of Cochran and 
Bliss in a different problem [1 ].) 

The method will be indicated by treating the simplest case involving a bi- 
variate normal distribution. Suppose z and y have a bivariate normal distribu- 
tion with means y, and y,, variances ¢,” and o,? and correlation coefficient p. 
We shall indicate the density by n(z, y| Mz, My; 2", oy’; p). Suppose n observa- 
tions are made on the pair (z, y) and N—n observations are made on x; that 
is, N—n observations on y are missing. The data are 


* * * * * TN, 
Yn 


The problem is to estimate the means, variances, and correlation by the method 
of maximum likelihood. 

We note that the bivariate density of z and y can be written as the marginal 
density of x times the conditional density of y given z; that is 


n(z, y| Bz, My; oy"; p) = | Hz, o)n(y | v + Oy-2"), (2) 


(1) 


where 
v = py — 
= poy/ oz, (3) 
= — p?). 


The likelihood function can be written 


n N 
Il N(La, Ya | Maz, My} p) Il Mz, 
a=1 a=n+1 


(4) 
N n 

II n(La| Mz, 72”) II n(Ya | + Bys%a, Fy-2"). 
a=l a=1 

The maximum likelihood estimates of uz, 02”, v, Byz, and cy.,” are those values 


that maximize (4). To maximize (4) with respect to u, and o,” we maximize 
us, 02”); this procedure gives us the usual maximum likelihood esti- 
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mates of the parameters of a univariate normal distribution based on N ob- 
servations, namely, 


N 


a=l 


= (ta — 


a=l 


(5) 


To maximize (4) with respect to v, Byz, and ys”, we maximize the second term 
of the right hand side of (4). This gives the usual estimates of regression pa- 
rameters, namely, 


(ve — — 


(te 2") 
v= — 


a=l 
The maximum likelihood estimates of y,, o,?, and p are obtained by solving 
(3), where v=%, Bye=Bye and oy.2?=6y.2". (The argument is that if any other 
values of uz, uy, 72”, oy7, and p gave a higher value to the left hand side of (4), 
they would give that higher value to the right hand side with v, Byz and oy.,” 
given by (3) according to these values of uz, uy, oz”, oy? and p.) Thus 


= + p's,’ = + 

= 


It will be noted that Z, is the so-called regression estimate in double sampling. 

Now let us see how this method applies to the problems solved by Edgett 
and Lord. In Edgett’s case there are three variables w, z, and y with a trivariate 
normal distribution. The data are 


Wi, *, Wn, Way, *, WN, 
Ti, * * * » (8) 
Yiy*** Yn 


The difference between the preceding case and this one is that z has been re- 
placed by the pair (w, z). Since the same approach is used, we will only sketch 
the procedure. The likelihood function is 


202 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1957 


Il n(Wa, Ta; Ya| Hw, ey By; Fw", os", Pwr, Pry) (9) 
a=l 


N 
n(wa, He; Tw", 72"; pws) 
a=n+1 


N n 
II n(Wa, La | Mw, Me; Ow’, II n(Ya| + Byw-zWa + Bys-wLa, Cy-ws); 
a=1 


where 
by — — Byz-whte, 
Byw-2 + = (10) 
FwF + = 
= Oy? — + 2B + 


The maximum likelihood estimates of 42, 27, aNd pws are computed in 
the usual manner from the N observations on (w, xz). The estimates of the 
parameters of the conditional distribution of y given (w, xz) are computed from 
the n observations on y and (w, z). Then jy, @,”, Duy, and fy are found from 
(10) with the other parameters replaced by the estimates. These results agree 
with Edgett’s. 

A problem treated by Lord involves three variables, say z, y and z, where 
z and y have a bivariate normal distribution and z and z have a bivariate nor- 
mal distribution. The data are 


* Yny (11) 
Zeit) °°» 
Here the likelihood function is 


nm N 
Il n(Za, Ya| May My; Tz", Fy"; Pry) II N(La; May Ms; Tx", Os"; 
a=n+l (12) 


= [J ue, II (Ya| ¥y + Bysta, + 
a=l a=] a=n+1 


As in the first case, u, and o,* are estimated from the N observations on z, 
and yy, cy? and psy are estimated by using the first n observations to estimate 
vy, Bye and Similarly o,* and pz, are estimated by using the last N —n 
observations to estimate v,, 8.2 and ¢,..2. Lord [3] gives these explicitly. (py. is 
not defined and is not estimable.) 

Other problems of missing observations (but not all) can be handled this 
way. For example, if the observations form the pattern 


* * Tatty * * * * * TNs 
Yi, * * * Yny Yntty * * Yntm, (13) 
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or the pattern 
Uy ** * Uny 
Unit, * * Unimy 


the method can be applied. 

Each case can be generalized to a multivariate (or vector) case. As an ex- 
ample, consider the first case where z’=(m,--+, and 
where the prime denotes a transpose. Let yu, be the (column) vector of means 
of z, uy the ve¢tor of means of y, ) 22 the matrix of variances and covariances 
of z, the \corresponding matrix for y and the matrix of covariances 
between z and\y. The generalizations of (5) are 


1 
= ta, 
aml 
(15) 
N 


For (6) we have 

n n -1 

a=] 

b= — (16) 
1 n n 

(ya — 5*)(Ya — §*)’ — B > (fe — 2°) (te — 

n a=l a=l 


and for (7) we have 
uy = 9* + Bz — 2*), 
Le = La! = 
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THE ANALYSIS OF INCOMPLETE BLOCK DESIGNS 


MarvIN ZELEN 
National Bureau of Standards 


Under the usual assumptions for the recovery of inter-block in- 
formation there may exist two independent variance-ratio tests for 
testing every null hypothesis pertaining to the treatments. A method 
is suggested for combining two independent tests, and an example 
shows the gain in power from using the second test. Exact confidence 
limits are given for the ratio of the inter-block variance to the intra- 
block variance. 


1. INTRODUCTION AND SUMMARY 


We incomplete block designs are used for procuring experimental data, 
it may be possible to make two independent analyses on the data. The 
usual analysis, termed the intra-block analysis, depends only on comparisons 
within the block, whereas the second analysis, termed the inter-block analysis, 
makes use of the block totals only. This latter analysis is sometimes referred 
to as “the recovery of block information” and depends on the assumption that, 
the block effects are random variables. 

Yates first showed that an inter-block analysis can be made for three di- 
mensional lattices [13] and balanced incomplete block designs [14]. Nair [9] 
and Rac [10] have given the general formulas which apply to any incomplete 
block design. More recently Bose and Shimamoto [3] and Bose, Clatworthy, 
and Shrikhande [4] have considered intra- and inter-block analysis of par- 
tially balanced incomplete block designs with two associate classes. The books 
by Cochran and Cox [5], Federer [6], and Kempthorne [8] give useful sum- 
maries of the complete analysis for many types of experimental plans. 

The customary use of the inter-block information is to combine this addi- 
tional information with the intra-block analysis so as to estimate the treatment 
effects with greater precision than if the intra-block estimates had been used 
alone. 

This paper points out that there exists a different type of inter-block analysis 
than that given in all the references above. If this new inter-block analysis is 
carried out, there will exist, under certain assumptions, two independent sta- 
tistical tests for testing every null hypothesis about the treatments.' Further- 
more, weights for combining the intra- and inter-block treatment estimates 
can be obtained which are statistically independent. 


2. NOTATION AND INTRA-BLOCK ANALYSIS 


Consider an incomplete block design having v treatments arranged in b 
blocks, such that each block contains k (v>k) experimental units, and each 
treatment occurs in r blocks. Let y;; represent the measurement of the i-th 
treatment in the j-th block. Furthermore, assume that the mathematical 
model underlying the measurements can be written as 


1 In an informal discussion, H. O. Hartley informed me that he also has obtained a similar result. 
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t 
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=ptti +b; + (i= 4,2, 1, 2, - 6) (2.1) 


where y;; is only defined when the i-th treatment occurs in the j-th block. 
The parameter » represents a constant effect common to all measurements, 
and t; and b; denote the constant effects of the i-th treatment and j-th block 
respectively. In order to obtain the minimum variance unbiased estimates of 
the unknown parameters, it is only necessary to assume that the {¢,;} are a 
sequence of uncorrelated random variables having a mean zero and (unknown) 
variance o?. However, in making all tests of significance we shall further assume 
that the {¢;;} follow a normal distribution. 


Let 
‘ if the i-th treatment appears in the j-th block 
ny = 
0 otherwise 
and define 

Y;. 

ye =— 

j r 

Y.; 

Y.. 

Y..= 


where the index of summation 7 always runs from 1, 2,---, v and j runs 
from 1, 2,---,b. 
Then it is well known (Bose [2], Kempthorne [8 ]) that the normal equations 
which give the intra-block treatment estimates are 
r(k — 1) 
k 


= ¢=1,2,---,9 (2.3) 


where 
(adjusted treatment totals) (2.4) 
i 


and \y.= >.; nijn,; is the number of blocks in which treatments i and s appear 
together. Since the above normal equations are singular it is common to add 


the restraint 
=0 (2.5) 


whick will make the solutions of the ¢,’s unique. Then the treatment sum of 
squares can be written as 


S= df.=v-1. (2.6) 
The error sum of squares is 
8S. = Dawe? 8 d.f. =n, = bk-b-v+1 (2.7) 
j 
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and is an estimate of n,o’. 
The test for the null hypothesis that all treatment effects are zero, i.e., 
Hy (h=h= =t,=0), is the F-ratio, 
S,/(v — 1) 
S,/n, 


and a test for determining if a contrast between the treatments is equal to 
zero, i.e. 


F (2.8) 


= Di = 0 (Lao) 


can be made using Student’s ¢, e.g. 


1/2 
[ var | 


d.f. =n, (2.9) 


8. INTER-BLOCK ANALYSIS 


In many experimental situations it is reasonable to assume that the block 
effects {b;} are random variables, uncorrelated with each other and with the 
{ey}, having a mean zero and (unknown) variance o?; i.e. 


E(0;) = 0, var (bj) = 


cov (b;, = 0 forj (3.1) 
cov = 0. 
A block total will be of the form 
Y.; = ku t+ + + (3.2) 


Hence, the expected value and variance of Y ; are: 
E(Y.;) = kw + 
var (Y.;) = k(o? + kos’). 
Furthermore, the block totals Y ; are uncorrelated with the Q,; i.e. 


(3.3) 


1 
cov (Y.;, Q,) = cov (Y.;, ¥;.) — cov (Y.;, Ym) 


= nij(o? + — + = 0 


and thus are uncorrelated with the ?;. Therefore, a contrast among the block 
totals x’ = =0) estimates a linear function of the treatments; 
that is 
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Be’) = + = 


provided g;= are not simultaneously zero for all i=l, 2,---, »v. 
Hence, this estimate will be uncorrelated with the corresponding intra-block 
estimate #= >>; gif; for the same contrast. 

The relations (3.1) are all that is necessary to assume about the {b,} in 
order to use the block totals for obtaining other estimates of treatment con- 
trasts which are uncorrelated with the corresponding intra-block estimate. 
However, in making all tests of significance it will be assumed further that the 
{b;} follow a normal distribution. 

The normal equations for obtaining estimates of t; using only the block totals, 
can be obtained by differentiating the quadratic form 


j=l 


with respect to the parameters u and ¢; and equating the resulting expressions 
to zero. The normal equations, cf. Kempthorne [8, p. 544], can then be written 


1 
k k sri 
where 
Q,’ = TY.. = — Ty.. (3.6) 
3 


(A prime is used to denote quantities appearing in the inter-block analysis which 
have counter-parts in the intra-block analyses.) 
Summing equations (3.5) over all treatments gives 


as >.; Q;’=0. Therefore, the treatment estimates obtained from solving (3.5) 
will always satisfy the restraint 


>t’ = 0. 


Thus, from the b block totals it is desired to estimate the grand mean y, and 
the parameters t;(i=1, 2, - - - , v) of which only (v—-1) are independent. There- 
fore, in order to estimate all v parameters, the rank of the quadratic form (3.4) 
must be >v. The necessary and sufficient condition for this to hold is that there 
exist at least v block totals having expectations which are linearly independent. 

Let (m—1) denote the number of dependent linear relations among the ex:- 
pectations of the block totals. This implies there exist a set of (m—1) orthog- 
onal contrasts in Y.; of the form 
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= > h=1,2,---,m-—1 
i 


Lag =1, =0 hg fine 


such that Z(L,) =0. Then the quantity 


1 m—1 
S.'(m — 1) = Ly? (3.8) 


is an estimate of (m—1)(c?+ko,”) with m—1 degrees of freedom. 
Two cases have to be considered: (1) b—(m—1) >v and (II) b—(m—1) <v. 


Case I: b—(m—1)>v 


This case corresponds to the situation most often met in practice. The normal 
equations (3.5) will then enable one to solve uniquely for the ¢,’ and all (v—1) 
contrasts among the treatments will be estimable. The inter-block treatment 
estimates can then be written 


= (3.9) 


where ||c;.’|| (¢, s=1, 2, - - - , v) is the inverse matrix corresponding to the co- 
efficient matrix of the normal equations (3.5). 
Now by straightforward algebra we have 


r r 
var Q,’ = (o* + 
k v 
cov (Q,’, Q,’) = + kev’) *), 
Thus the variance of any inter-block contrast is 
var 7’ = var ( = 
= (0? + kos) 
s=1 
The sum of squares associated with treatments can be written 
> 
i 
and has the expected value 
1 2 
i 


The inter-block residual sum of squares 


(3.10) 
(3.11) 
(3.12) 
(3.13) 
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1 2 
j F 
is an estimate of (b—v)(o?+ko,”); in addition S,’/(¢7+ke,?) follows the chi- 


square distribution with n,’ = b—v degrees of freedom and is independent of S,. 
Alternatively the residual sum of squares can be written 


= ) Y.;2 — bky..2 — wav. (3.15) 


When m>1, the residual sum of squares can be further subdivided, i.e. 
S,’ = 8,'(m—1) + S,’(b—v—m+1) (3.16) 


where S,’(m—1) corresponds to (3.8) and S,’/(b—v—m-+1) is a remainder sum 
of squares having (b—v—m-+1) degrees of freedom. If the (m—1) relations 
(3.7) correspond to a grouping of the blocks into m replication groups (each 
grouping of blocks consists of one or more replicates of the treatments) then in 
some instances S,’(b—v—m+1)/(b—v—m-+1) may only be used for estimating 
(o*+ko,”). This would be the case if the variation between replicate groupings 
is significantly larger than the variations between blocks (within replicate 
groupings). 

In any event one can test the null hypothesis that all treatment effects are 
equal, e.g. Ho(i=h= -- + =t,=0), using the F-ratio 


S,’/(v — 1) 
Se /n,’ 
A test of a hypothesis that any contrast among the treatments is zero can also 
be made using the Student-t statistic corresponding to equation (9), e.g. 


j 


S. 
a? = — (3.18) 


with n,’ degrees of freedom. Furthermore, these inter-block tests of significance are 
statistically independent of the analogous tests made in the intra-block analysis. 
The results of the inter- and intra-block analyses can be arranged in the analysis 
of variance table shown in Table I. 

If m=1 and b=», it will only be pessible to obtain additional estimates of the 
treatments from the block totals. It will not be possible to use these inter-block 
estimates for making additional tests of hypothesis because no estimate of 
(o?+-ko,”), independent of ¢;’ is available. 

Case IIT: b—(m—1) <v 

If b—m+1<vp, then it is possible using only the inter-block information to 
estimate (b—m) contrasts among the treatments. In addition, an estimate of 
(o?+-ko,”) having (m—1) degrees of freedom can be calculated using (3.8). 
The particular treatment contrasts which are estimable will depend on the 
experimental design. 


F d.f. = (v — 1) and n,’. (3.17) 
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Corresponding to the treatment sum of squares is the quantity 


2 2 


oni bk 
having (b—m) degrees of freedom. The expected value of S,’’ is 


1 2 
The variance ratio 
S,/’/(b — m) 


~ S.(m — 1)/(m — 1) 


(3.21) 


having (b—m) and (m—1) degrees of freedom, tests the null hypothesis that all 
(b—m) estimable contrasts among the treatments are equal to zero. It is doubt- 
ful whether such a test is of any value. If m=1, no estimate of (o?+koy*) is 
available which is independent of ¢;’, and hence there can be no tests of signifi- 
cance using only the inter-block information. 

An example will serve to clarify the discussion. Below is the layout of a par- 
tially balanced design with parameters b=4, v=4, k=2, r=2. 


Blocks 


AX oles 
a ale 


The 4 blocks are so arranged that they form two single replications. It is ob- 
served that m—1=1 or m=2 as there is only one dependent relation among the 
expectations of the block totals, e.g. 


E(Y 1) +E(Y¥ 2) —E(Y¥ 1) —E(Y =(a+b) +(c+d) —(a+e) —(b+d) =0. 


Therefore b—m=2 and it is only possible to estimate two treatment contrasts. 
These two contrasts are given by (a—d) and (b—c) which can be easily verified 
using the normal equations (3.5). Of course any linear function of these con- 
trasts will also be estimable, i.e. (a—d)+(b—c), (a—d) —(b—c). However, there 
can be no estimate obtained from the inter-block information alone on contrasts 
of the form (a—b), (c—d), (a—c), and (b—d). 

The test of the hypothesis that the two estimable contrasts are simultane- 
ously equal to zero, i.e. 


Hy: (a -d =0,b—c = 0) 


is given by the F-ratio (3.21) having 2 and 1 degrees of freedom. 

If this basic experimental plan is replicated several times, the additional 
information can only be used in the inter-block analysis for estimating 
(o?+ko,*) with greater precision. There will still be only two estimable inter- 
block contrasts among the treatments. 
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Formulas for application to balanced incomplcte and partially balanced designs 
(two associate classes) 

Since the balanced incomplete and the partially balanced incomplete block 
designs are the two most important classes of incomplete block designs, their 
inter-block treatment estimates are recorded below. (These formulas also apply 
to the 2-dimensional lattice designs (v=k*) as the balanced lattice and the 
partially balanced lattice designs are respectively balanced and partially bal- 
anced (2 associate classes) incomplete block designs. 

With respect to the balanced incomplete block designs, the inter-block treat- 
ment estimates are 


Q,’ E v(k — 1) 
(i k(v — 1) 


where £ is the intra-block efficiency factor. The variance between two treat- 
ments is 


t,’ (3.22) 


+ kos?) 
(1 — E)r 


For partially balanced designs with two associate classes, the inter-block 
treatment estimates can be written 


+ e1’Si(Qi") + 


r 


var (t,’ — t,’) = (3.23) 


(3.24) 


where S,,(Q;’) denotes the sum of the Q”’s which are mth associates (m=1, 2) of 
treatment 7, and 


Cm — 
Cu war ee (m ) (3.25) 


(The parameters cn, A, \m, H are the usual parameters for partially balanced 

designs, cf. [3], [4].) The variance between two treatments which are mth 

associates (m=1, 2) is then 

2(k — Cm’)(o? + kas?) 
r 


var (t;’ — t,’) = (3.26) 


4, COMBINING INTRA- AND INTER-BLOCK TESTS OF SIGNIFICANCE 


Since for the case b—m+1>v there will always exist two independent F- 
ratios for testing any general linear hypothesis about the treatments, it seems 
desirable to combine the two statistical tests into a single test such that the 
power of the combined test might be greater than either of the individual tests. 
For this purpose let F; and F, be the two F-ratios obtained from the intra- and 
inter-block analyses respectively, with reference to a particular null hypothesis 
H>. Corresponding to each F-ratio there will be a probability P; defined by 


P,;=P{F>F;|Ho} i=1,2 (4.1) 


4 
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TABLE II 
CRITICAL VALUES OF C, FOR a=.05, .01 


6 a=.05 a= .01 
0.0 .050000 .010000 
0.1 .045000 .009000 
0.2 .040000 -008000 
0.3 .035004 .007000 
0.4 .030062 -006001 
0.5 025321 .005013 
0.6 020956 .004062 
0.7 017092 .003190 
0.8 013775 .002432 
0.9 010995 -001805 
1.0 008705 001309 


which is the probability of exceeding the observed F; if the null hypothesis is 
true. 

Since the intra-block test will always be more powerful than the inter-block 
test, it will be useful to have a procedure for combining the two independent 
tests such that the intra-block test is given greater weight. 

One simple way to weighi the two tests is to choose a weight @ and use the 
critical region 

w:{P,PY<C}, (4.2) 


Therefore, the null hypothesis is rejected with significance level a if P;\P°<C. 
where C,, is chosen such that 


P{P,P# < Cx} = a (if the null hypothesis is true). 


Table II contains critical values of C, for a=.01, .05 and @=0(.1)1.0. It should 
be noted that the combined test given by the critical region (4.2) makes use of 
the probability integral transformation (4.1) and is not an explicit function of 
the adjusted treatment estimates. 

The weight @ is an appropriately chosen weight function which weights the 
inter-block test relative to the intra-block test. Note that if @=0, only the intra- 
block test is used; if @=1 this is equivalent to giving both tests equal weight 
and actually corresponds to the well known method of R. A. Fisher [7] for 
combining two independent tests of significance. (See Wallis [12], Birnbaum 
f1}). 

One possible weighting function is to let 


where E£ is the efficiency factor for the intra-block analysis. ({1—£] is the ef- 
ficiency factor of the inter-block analysis.) Then as p=?/o? becomes large, 


(4.3) 
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the weight @ will approach zero, resulting in only using the intra-block test. 
An investigation of the power of such a combined test is given in [15|. The 
calculations of the power function in [15] indicate that (4.3) gives an “opti- 
mum” weighting function. 


5. COMPUTATIONS ILLUSTRATING THE POWER OF THE COMBINED TEST 


In this section we shall compute the power of the combined test, { P:P.’<C} 
for a particular balanced incomplete block design with respect to the null 
hypothesis that all treatments are equal, i.e. 


+--+ =t, = 0) 
against the alternative hypothesis 
t; — 
i 


For a balanced incomplete block design, the expected values of the mean 
squares in the analysis of variance table corresponding to Table I can be 
written in the form of Table IIT. 

TABLE III 


ANALYSIS OF VARIANCE FOR BALANCED INCOMPLETE 
BLOCK DESIGNS 


Source Degrees of freedom Expected value of mean square 
Blocks (unadjusted) b-1 
Z(t, — t)* 
(adjusted) + koy? + a E)r 
inter block error =b—-—v o? + ko»? 
Treatments o* + Er 
Intra block error = bk 
Total bk —1 


Now consider a balanced incomplete block design with parameters 
v=6,r=5,b=15,k =2,r\ =1, = 3/5. (5.1) 


The intra-block variance ratio test for the null hypothesis that all treatments 
are the same will have 5 and 10 degrees of freedom; the inter-block variance 
ratio for testing the same null hypothesis will have 5 and 9 degrees of freedom. 

Table IV summarizes the results of the power function calculations using a 
.05 level of significance for p=0, 4, 1, 2, and 10. Also given is the power of 
Fisher’s method (@=1). If the intra-block test had been used alone, the power 
of the test is .62. Note that for this example Fisher’s method gives smaller 
power than the proposed combined test. Also, for the non-zero values of p, 
Fisher’s method results in poorer power than if only the intra-block test had 
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TABLE IV 
SUMMARY OF POWER FUNCTION CALCULATIONS 


Probability of detecting departure from 


null hypothesis 
p = 
Pour P 1- =( 1 )- 2 ( 1 ) 
E 1+kp 3 \1 + 2p 
0 -76 -76 
4 -63 -66 
1 .56 -62 
2 .48 -62 
10 -46 -62 


been used alone. However, with respect to the proposed combined test, the 
power is never less than the power of the intra-block test. Of course as 
p = 0y"/o? becomes large, the inter-block test will have negligible power and the 
proposed combined test will give the inter-block test a small weight, i.e. for 
p=10, 0=2/63. 

In the practical situation, the parameter p is never known exactly but has 
to be estimated. If the degrees of freedom for both estimates for the inter- and 
intra-block errors are large, then the estimate for o?/(o?+-ko,”) will usually be 
suitable to use for the weighting function 

1-£ 

+ 


Even if this is not the case, the use of the estimated quantities in place of the 
parameter values may be justified, because the power of the proposed com- 
bined test is somewhat insensitive to knowledge of the exact value of 6, cf. [15]. 
It is only necessary that the value of the weighting function be in the “neigh- 


borhood” of 


1-E 
E 
6. CONFIDENCE LIMITS ON AND o?/(a?+koy?) 


The customary inter-block analysis (Kempthorne [8], Cochran and Cox [5], 
Rao [10]) involves the calculation of a mean square termed “blocks adjusted 
for treatments” having (b—1) degrees of freedom and an expected value of 


b-1 


It can be shown that this mean square only follows a chi-square type distribu- 
tion for incomplete block designs having a dual design which is a balanced in- 
complete block design. For all other incomplete block designs the distribution 
of the “blocks adjusted for treatments” mean square is unknown. (For example, 
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Kempthorne [8, pp. 463-469] discusses a particular case where this mean square 
arises from a mixture of chi-square distributions.) Hence, if m=1, no estimate 
of (o?+ko,*) is available using the current techniques for the inter-block analy- 
sis. However, from the development in section 2, the quantity S,'/(o?+ko,?) 
follows a chi-square distribution with n,’ degrees of freedom and is independent 
of S,. Therefore, a confidence interval with confidence coefficient (1—2a) for 


is 
1 F 1 /F 
—{—— 1} < — < —[— -1 
6.0 
and for o?/(o?+ko,”) the confidence interval is 
F 


where F, and F;_. are the 100a and 100(1—a) percentage points of the F 
distribution having n,’ and n, degrees of freedom and 


S, /n.’ 
S./n 


The only restriction for calculating these confidence limits is that b—m+1>». 
Otherwise, they are completely general and can easily be calculated for any 
incomplete block design. The only alternative ways for calculating confidence 
limits on o,?/c? or o?/(o0?+koy”) are (a) if m>1 use S,’(m—1) with (m—1) de- 
grees of freedom in place of S,’ in equations (6.1) and (6.2), or (b) use the meth- 
od due to Thompson [11]. It is clear that use of S,’(m—1) will result in wider 
confidence intervals than those given in (6.1) and (6.2) because the estimate for 
(o?+-koy*) will have a smaller number of degrees of freedom. On the other hand 
Thompson’s method requires that the characteristic roots of N’N be known 
where N =||n,|| is the »Xb incidence matrix of the design. This requirement 
limits the application of Thompson’s method to incomplete block designs whose 
dual designs are either balanced incomplete block designs or partially balanced 
incomplete block designs with two associate classes. 


Acknowledgement: The author would like to thank Mary L. Epling for cal- 
culating the power points in Table IV; also thanks go to Oscar Kempthorne 
for many helpful comments. 
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NON-ADDITIVITIES IN A LATIN SQUARE DESIGN! 


M. B. 
Iowa State College and Princeton University* 
AND 


Oscar KemMPTHoRNE® 
Towa State College 


A brief history of the Latin square design is given. A generalization 
of the design to the case in which the rows, columns and treatments 
represented in the experiment are samples from populations of rows, 
columns and treatments respectively is studied. A possible frame of 
reference for the interpretation of the experimental results is described. 
The leads to what is termed a “population model,” various population 
parameters, means and components of variation which are of interest 
to the experimenter. No assumptions are made about additivity of ex- 
perimental units and treatments. Results on expectations of mean 
squares in the analysis of variance are given in terms of quantities 
(denoted by 2’s) which result in a concise description and in terms of 
the components of variation. Biases in the estimation of components 
of variation by means of the analysis of variance are discussed and 
assessed. Comparisons of randomized block designs and Latin square 
designs are given for the general case of non-additive treatments for 
random, mixed, or fixed population of rows and columns. A generaliza- 
tion of the design is discussed. The mathematical machinery which is 
used to derive the results is presented briefly. Finally linear estimates 
and errors of estimates are discussed. One main conclusion of the study 
is that the Latin square analysis of variance may overestimate the error 
of treatment comparisons and underestimate the component of varia- 
tion associated with treatment main effects. 


INTRODUCTION 


HE present paper gives results and discussion concerning the statistics of 

experiments involving the Latin square design, and is part of a series of 
investigations by one or both of the present authors on the meaning and role of 
linear models in the analysis of variance of randomized experiments. 

The line of investigation stems back to the basic writings of Fisher [5, 6] and 
Yates [24, 25]. The procedure employed is related to the work of Neyman 
et al. [12]. A recent step was the “finite model analysis” of Kempthorne 
[10, 11*] in which the basic designs were considered under the assumption of 
additivity of treatment and experimental unit. A step in the direction of study- 
ing non-additivities was made for the case of randomized blocks by Kemp- 
thorne [10] and more general treatment of this case was given by Wilk [19, 20]. 
Research sponsored by the Wright Air Development Center on the “mixed 
model” controversy led the authors to the combination of the methods of 

1 Paper presented at a joint session of the Institute of Mathematical Statistics and the American Statistical 
Association at New York December, 1955. : 

* Paper prepared partly under Analysis of Variance project of the Statistical Laboratory, Iowa State College 
and partly in connection with research sponsored by the Office of Naval Research. 


* Journal paper No. J-2952 of the Iowa Agricultural Experiment Station, Anes, Iowa. Project 890. 
* Originally written and presented in December 1952. 
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“finite model analysis” with genera! sampling of treatments and experimental 
material [23]. This development made use of the design random variables of 
Kempthorne [10, 11] and the sampling dummy variables of Cornfield [2]. The 
relationship of our early work on the completely randomized design to the 1949 
work of Tukey [17] (unpublished and unknown to us at the time), Cornfield 
{3], and Bennett and Franklin [1], and Cornfield and Tukey [4] has been 
discussed elsewhere (Wilk and Kempthorne [22], Cornfield and Tukey [4]). 
Other relevant investigations are those of Scheffé [15] and Smith [16]. Work 
indirectly related to the present context is that of Fisher [7], Welch [18] and 
Pitman [14] on randomization tests. 

The Latin square was first proposed as an experimental design by Fisher [5]. 
The early history is best given by the following quotation from Fisher [6]: 

“Systematic arrangements in a square, in which the number of rows and of 
columns is equal to the number of varieties such as 


ABCDE ABCODE 
EABCOD DEABC 
DEABC BCDEA 
CDEAB EABCOD 


BCDEA CDEAB 


have been used previously for variety trials in, for example, Ireland and Den- 
mark; but the term ‘Latin square’ should not be applied to any such system- 
atic arrangements. The problem of the Latin square, from which the name 
was borrowed, as formulated by Euler, consists in the enumeration of every 
possible arrangement, subject to the conditions that each row and column shall 
contain one plot of each variety. Consequently the term Latin square should 
only be applied to a process of randomization by which one is selected at 
random out of the total number of Latin squares possible: or, at least, to 
specify the agricultural requirement more strictly, out of a number of Latin 
squares in the aggregate, of which every pair of plots not in the same row or 
column belongs equally frequently to the same treatment.” 

The actual randomization procedure for some sizes of the square was given 
by Yates [24]. The enumeration of 6X6 Latin squares was done by Fisher and 
Yates [8]. Yates [25] gave a detailed discussion of Latin squares in which a 
row, column or treatment is missing and showed that “incomplete Latin 
squares of these types give unbiassed estimates of error and are therefore valid 
experimental arrangements.” The validity of the Latin square was discussed 
by Neyman et al. [12], using a mode of reasoning related to that of the present 
paper. The 7 X7 Latin squares were enumerated by Norton [13]. (The problem 
of orthogonal squares is not relevant in our present context.) The distribution 
of a criterion equivalent to the variance ratio (treatment sum of squares di- 
vided by treatment plus error sum of squares) was examined for the Latin 
square by Pitman [14] and Welch [18]. An evaluation of the basic Latin square 
design on the basis of a derived linear model with the assumption of additivity 
of units and treatments and no additional sampling was given by Kempthorne 
[10, Section 10.5]. 
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Following Fisher [6] we shall take the term Latin square design to imply 
both a geometrical pattern and a suitably restricted randomization procedure. 
For our immediate purposes, which involve first and second moments only, 
we take Fisher’s rule in the last part of the last sentence of the quotation above 
as defining the randomization restrictions. 

The structure of the paper is as follows. First an experimental situation and 
design, which is in fact a generalization of the basic Latin square design, is 
described formally. A frame of reference for analysis is then formulated and 
some general results on expectations of mean squares given. Some comparisons 
of the Latin square design and the randomized block design are given and 
extension of the design discussed. The machinery by which the results were 
obtained is described briefly. The paper closes with a short discussion of esti- 
mates both of components of the population model and of errors of estimates. 

One main conclusion of the study is that the Latin square analysis of vari- 
ance may seriously overestimate the error of treatment comparisons and under- 
estimate the component of variation associated with treatment main effects. 
In spite of this, the Latin square will, in many circumstances, be more ad- 
vantageous than the randomized block design. 

We would like to thank John Tukey for helpful comments in connection 
with the preparation of this paper. 


EXPERIMENTAL UNITS AND TREATMENTS 


Our analysis is centered around two basic notions, namely treatment and 
experimental unit. A treatment is a combination of stimuli or operations im- 
posed by the experimenter, who is usually able to identify the most relevant or 
essential features. While it is usually reasonable to regard a treatment as per- 
fectly reproducible in principle, treatments are ordinarily not perfectly re- 
producible in practice. For the analysis given here it will suffice to assume that 
to each treatment there corresponds a population of combinations of stimuli or 
operations from which one combination appears at random in each use of the 
treatment. This assumption is likely to be a satisfactory approximation to 
many real situations, and it does cover to a certain extent (i) variabilities arising 
from inevitable limitations in technique in, say, attempting to apply a pressure 
of exactly 10 p.s.i. and (ii) such treatments as variety of corn where we employ 
individual, somewhat variable, seeds and not an “abstract variety.” 

In most cases, an experimental unit may be thought of as quite apart from 
the experiment, but we use the term to denote the complex of experimental 
material and circumstances associated with possible treatment applications in 
the experiment. The relevant background for the experiment, so far as statisti- 
cal inferences are concerned, is tied up with the appropriate population of 
experimental units. The latter is determined by such random sampling and 
allocation as is built into the experimental procedure, implicitly or explicitly. 
Just how identifiable experimental units are will vary from experiment to ex- 
periment. The association of units with individual animals of a litter may be 
practically sufficient in comparisons of rations. In the case of comparative 
psychological “tests” on the same person at different times, the experimental 
unit is a particular combination of person and test environment. 
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The experimenter is usually confident or at least hopeful that only certain 
features of the whole complex of conditions and stimuli involved in the applica- 
tion of a treatment are essential. For example, in noting the response of a 
plastic to a pressure of 50 p.s.i. we might ignore such details as the material of 
the pressure plates and whether the mechanism is hydraulic or not. On the 
other hand, if the rate of pressure increase is greater for a final pressure of 100 
p.s.i. than that for 50 p.s.i., then the description of a treatment by the final 
pressure would probably be quite inadequate. Just how incomplete the de- 
scription of a treatment may be is not in general a statistical question. The 
similar ambiguities related to experimental units are usually of less importance 
if constancy of their properties, rather than complete description, can be as- 
sured and if an adequate randomization procedure is followed. 

We shall take the point of view throughout that an experimental unit is not 
reproducible (can be used only once). Usually experimental units are “given” 
in the sense that the experimenter cannot or does not feel it worthwhile to con- 
trol their variation by means other than randomization. 

The distinction between treatment and experimental unit is the essential of 
the difference between a Latin square experimental design, involving as a 
basis ¢? units and ¢ treatments, and the fractional factorial 1-in-t of a tXtXt 
experimental situation where ¢? treatments are selected according to a Latin 
square type pattern, at random, from the possible ¢* factor combinations. The 
latter procedure may be called Latin square sampling of factor combinations. 
The actual experiment with the selected treatments will involve experimental 
units as well, and some experimental design specifying the randomization of 
treatments to units. 


THE EXPERIMENTAL SITUATION AND PROCEDURE 


We suppose that there are R.C experimental units classified into R rows and 
C columns with one unit at each row-column intersection, for example plots 
of land or portions of a roll of paper, classified by distance along and distance 
from an edge, or units of person-hours classified by person and by hour. We 
suppose also that there is a population of 7 treatments, which are reproducible 
at will. The notation and experimental procedure are then as follows: 
(1) denote row in unit population by i=1, 2,---, R, 
(2) denote column in unit population by j=1, 2,---,C, 
(3) denote treatment in treatment population by k=1, 2,---, T, 
(4) select at random ¢ rows and let 7*=1, 2, - - - , ¢ denote the selected rows 
in the (random) order of selection (¢<R), 
(5) select at random ¢ columns and let j7*=1, 2, - - - , ¢ denote the selected 
columns in the (random) order of selection (t<C), 
(6) select at random ¢ treatments and let k* =1, 2, - - - , t denote the selected 
treatments in the (random) order of selection (<7), 
(7) select a Latin square at random from the totality’ of tX¢ Latin squares 
and use it to assign selected treatments to selected experimental units, 
(8) let f=1, 2, - - - , t denote replicates of a given treatment. 


5 For our purposes (and from practical necessity for the larger squares) this can be related to “any grouping 
of t Xt squares which fulfil] the requirement of Fisher's (6) rule.” Any transformation set has this property. 
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We may now characterize the resultant ¢? observations in two ways both of 
which are useful 

(1) z#¢ is the observation on the unit corresponding to selected row 1* and 

selected column j* 

(2) 2,7 is the observation on the fth replicate of selected treatment k*. 

We use the 2; when we work with row and column totals, and the 2+, when 
we work with the treatment totals. It may be recalled that some writers use a 
notation equivalent to z,*;«*) indicating that once 1* and j* and the plan are 
given k* is fixed. 

The examination we shall make includes the case when the totality of rows 
and columns are used which corresponds to the usual Latin square design. 
The possibility of a broader experiment involving a sampling of rows and 
columns does not appear to have been discussed in the literature, but a more 
inclusive examination seems to be of practical as well as theoretical interest. 

In the case of an experiment involving t=6, T=100, R = 200, C=6 we may 
be concerned with one or more of the following inference problems: (i) obtain- 
ing an estimate of variability of members of the population of 100 treatments 
relative to the entire population of 1,200 units; (ii) a similar estimate as in 
(i) but relative only to the actual 6 rows and 6 columns which were employed 
in the experiment; (iii) estimates concerning differences among a subset of 
treatments actually employed relative to any subset of the relevant experi- 
mental unit population; etc. In general the possible scope of the inference will 
be dictated by the experimental design and procedure. Any analysis suggested 
by the data itself is subject to the usual criticism but may be very useful and 
important nonetheless. Extension of inferences beyond the statistical popula- 
tions, as determined by the random sampling and allocation procedures, can- 
not have a statistical justification or assessment. Such extensions are of the 
utmost scientific importance but must lean on many factors, only one of which 
is the statistical analysis. 


FRAME OF REFERENCE FOR ANALYSIS; POPULATION MODEL 


A prerequisite for formal analysis of the experimental data is « relevant 
conceptual “frame of reference” which we now develop, based on some explicit 
elementary or primitive assumptions. We utilize the notion of a true or typical 
yield for each particular combination of treatment and experimental unit. It is 
supposed that if one could use the same unit over and over again and one could 
apply the same treatment each time one would get a population of yields which 
would have a “representative” (often chosen to be the mean) value Yj (which 
we shall refer to as the “true response”) dependent only on the particular unit 
and treatment considered and not on the whole plan of treatments as assigned 
to units. This assumes that there is no competition or collaboration between 
units. It is easy to think of agronomic experiments for which this is not the 
case, for example an effective insect repellent on a particular plot of land 
causes the insects to move on to a plot of land which is being treated with an 
ineffective repellent. 

The conceptual array of RCT numbers { Y;s} represents the totality of 
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relevant empirical information that is all that experimentation alone can give 
(as opposed to summary cause and effect, or mechanistic, notions). This is 
frequently a small part of the knowledge the experimenter wants, which is 
usually laws or patterns of behavior, or generalization to a wider background 
than the experiment provided. 

The scale of observation will be assumed to be given. The procedure to be 
followed is not dependent on the scale of observation. Some scales are however 
better than others, as indeed is obvious without statistical considerations. One 
important aspect of our conclusions is that the choice of scale is an important 
problem from the point of view of statistical inference alone. 

The statistical problem is what our observations tell us about certain pa- 
rameters of (functions of the elements of) the population of RCT true responses. 
A useful decomposition of the “true” responses, which we will call. the popula- 
tion model, is 


Yin te; + + + + (ret)in (1) 
where, with replacement of a suffix by a dot denoting averaging over the suffix, 
= — we, = th 
= Yu. — — + Y..., (Dae = (Yiu — — Vian + 
(ct)ix = Y — Y.4+ Y..., and 
(ret) sie = Vise — — Yew — Vin + + + 


This is an algebraic identity involving main effects and interactions. Each of 
the terms of (1) has a definite physical interpretation. For example, & is the 
mean difference which would be observed, averaged over the totality of experi- 
mental units, between treatment k and the mean of all treatments in the treat- 
ment population. As in all cases where main effects and interactions are used, 
it is important to note that a term depending on certain of the factors of treat- 
ment or classification depends on the totality of the other factors also; for 
example, (rc),; is based on an average over the population of treatments con- 
sidered. The number of parameters in (1) is (1+R)(1+C)(1+T7), the excess 
over RCT being due to the relationships: 

= 0; Les = 0; =0; (re)s = 0; (ha = 0; 
or 


iorj 


(ta =0; L (retin = 0. 
jork ijork 
The question may be raised as to whether concern with interactions is justi- 
fied. The answer is emphatically “yes” since we can make them large by an 
appropriate transformation of the scale of observation (and by the same token 
may be able to make them small). If there are no interactions, (1) reduces to 


Ya (2) 


which states, for instance that treatments k and k’ always differ by exactly 
t,—t, regardless of the row or column in which the difference is evaluated. In 
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this case, a basis (usually non-statistical) for inferring from the experimental 
population to another (usually broader) population of real interest is that addi- 
tivity holds also over the “target” population. This can, of course, only be an 
assumption. 

To take account (in our formal specification of a conceptual frame of refer- 
ence or population) of additional variabilities such as errors of measurement of 
responses, variabilities in treatment application and variation in the state of 
units (independent of treatment randomization), we make the assumption that 
the observable response if treatment k were applied to unit (7) is a random 
variable Yisky 


= Vise + (3) 


where the ¢,,, are assumed to be uncorrelated random variables with means 0 
and constant variance o?. The e;;, contain all technical error (error of technique, 
measurement, and sampling if it is used). We have thus introduced an array of 
RCT populations, represented by the RCT random variables { yj, }. (The dis- 
tributional assumptions of the «’s can be weakened, at the cost of somewhat 
more awkward formulas, for instance we might assume that = 
(known) or V (ei) =0;;*. Because of the randomization of treatments to ex- 
perimental units, the latter assumption, that the variance of technical errors 
depends on the experimental unit, will not complicate our statistical assess- 
ment of treatment differences, at least with regard to their estimated variances.) 


DEFINITIONS 
We have now described the RCT ccnceptual pepulations which we take as 
the frame of reference for analysis of the experimental data and proceed to the 
definition of additional parameters which we shall use. As measures of dis- 
persion for the various sets of defined effects and interactions we use com- 
ponents of variation as follows: 


1 1 1 


R 
1 1 
(R (re) «37; on? = (R I(T 1) (rt) (4) 


(ct) ores? 


Ore? 


In general each of these is a sum of squares divided by the number of quantities 
less the number of linear dependencies. These components of variation are 
related to Gini’s mean squared difference and its generalizations. We have 
avoided the term components of variance because of possible confusion with 
the conventional definition of variance of a random variable. 


RESULTS ON EXPECTATIONS OF MEAN SQUARES 


We use the term “expectation” in the standard statistical sense, meaning 
average (weighted by probabilities if unequal) over the population of repeti- 
tions, and specifically this includes the population of randomizations. 
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We shall first give the expectations of mean squares of the usual analysis of 
variance in a condensed form which displays the inherent mathematical struc- 
ture (though displaying not at all the structure of greatest experimental in- 
terest). In Table 1 they are given in terms of subscripted quantities 2 (read as 
cap sigma), the definitions of which are given in the lower part of the table. 


TABLE 1 
EXPECTATIONS OF MEAN SQUARES (condensed form) 
Mean Square Expectation 
Rows R* Zo+tz, 
Columns ce Zot+tz. 
Treatments 
Discrepance D* Zo 
Definitions of the 2’s 
1 1 
= = o,? ort? + CT 
T RT 


1 1 1 
= — R On? — + Re 


1 

1 

= ore? — 


1 
Le = oes? — R 


Zret = 


Zo = + Leet + + Lee + 


The >’s have an obvious and simple structure in terms of the components of 
variation. They are not necessarily positive and cannot therefore be measures 


of dispersion of any population. 
The equations which give the components of variation in terms of the = 


quantities are also simple: for example 


1 1 1 
2 = —z, —z, — Lre 


and 


1 
ort? = 2+ Zret: 
Writing the expectations of mean squares in terms of the 2 quantities makes 
it immediately obvious just what can be estimated unbiassedly by linear com- 
binations of mean squares, namely linear combinations of 2X, 2,, Z. and 2. 
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The “natural” or common way of estimating o,’ is by 1/t(7*—D*) which in 
fact estimates 2, and as an estimate of o,’ is subject to a bias of 


1 1 1 


Cc RC 


which will usually be negative. It is noteworthy that the magnitude of the bias 
depends on the absolute magnitudes of R and C and not their magnitudes 
relative to t. 

An alternative form for the expectations of mean squares is also useful. 
Table 2 gives them in terms of the components of variation, which are the 
quantities of experimental interest. 

The results for the special case t=R=C=T, which corresponds to the 
“usual” Latin square design, are given in Table 3, for comparison with other 
work ‘e.g. Neyman et al. [12]). 


TABLE 2 
EXPECTATIONS OF MEAN SQUARES 
Mean Square Expectation 
R 
D* + + op? + 
1 1 1 
TABLE 3 


A SPECIAL CASE (=R=C =T) 


Mean Square 


R* +} (1 =) Fret? + + to,? 
c* + (1 =) + or? + to,? 
of + (1 =) Cret® + + toy? 


| 
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DISCUSSION OF EXPECTATIONS 


(1) When we consider the Latin square as a design for comparing treat- 
ments, we see that if in fact the treatments are identical then a,;’, 
rs’, Oe” and o,* will be zero and the expectations of treatment mean 
square and discrepance mean square are equal, regardless of ¢,.?, which 
reflects the interactions of rows and columns. This was first stated by 
Fisher in 1926 and is the basis for use of the design to evaluate treat- 
ments. 

If the treatments and experimental units are additive then gye:?, +4" 
and g,,’ will be zero and the quantity 1/t(7*—D*) estimates o,* un- 
biassedly. (The case of additivity for t=R=C=T was treated by 
Kempthorne [10; Section 10.5].) 

Neyman et al. [12] in studying the case =R=C=T came to the con- 
clusion that the Latin square was always positively biassed, in the sense 
that, if then is always greater than H(D*). Our results 
indicate that the reverse will ordinarily be true. The origin of the dis- 
crepancy lies in the fact that they did not include the row-treatment 
and column-treatment interactions in their manipulations, and this led 
to the incorrect conclusion that 1/t#(T* —D*) equals o,2+-(1/RT) 
This is not the form of the result of Neyman et al. because they made 
additional homogeneity assumptions. 

(4) As an error term for treatments D* is usually too big on the average, 
but if R and C are large the overestimation of error is unimportant. 

(5) There appears to be no reasonable case for considering the usual agro- 

nomic Latin square experiment to have an origin such as the sampling 
of rows and columns here considered, even if the actual field in which 
the experiment is performed is a very small part of the population of 
interest. It must be remembered that in the situation examined here 
the whoie population of interest is stratified by rows and columns before 
the sample of experimental material for the experiment is drawn. 
It would appear to be rare that, with sufficient heterogeneity of experi- 
mentai material to warrant a two-way classification, treatments would 
act additively with respect to both rows and columns. It may be that 
the three-way interactions of rows, columns and treatments is often 
small but the usual way a Latin square experiment is done, i.e. taking 
rows and columns in such a way as to account for appreciable variability 
of the units, is such as to make row and column interactions with treat- 
ments very likely, because the widespread dependence of response to 
treatment on yield with any particular treatment would result in such 
interactions. 

(7) The Latin square design does in general reduce the actual error associ- 
sociated with the evaluation of treatments (as may be seen from Table 2 
by using the fact that the average variance of treatment differences is 
proportional to the treatment mean square after deleting the o,? term) 
but at the same time may seriously over-estimate the error if R or C 
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are not large. Most agronomic uses of the Latin square design fall in this 

category. The implications of this in specific situations should be con- 

sidered. The design will certainly be good where anticipated treatment 
differences are large and their accurate “point” estimation is the main 
concern. 

In most statistical experimental situations it is desirable to consider 

tests of significance and tests of hypotheses about treatment effects. 

We accept the view that tests of significance are evaluatory procedures 

leading to assessments of strength of evidence against particular hy- 

potheses, while tests of hypotheses are decision devices. We are here 
concerned with the former, and in this connection it should be noted 
that 

(a) the expectations of mean squares are in some degree irrelevant to 
the exact (permutation) test of significance of the null hypothesis 
that the treatments are identical. The procedure by which this null 
hypothesis would be evaluated, in strict theory at least, would be 
to superimpose on the observations obtained with a particular plan 
all possible randomizations of plans (but not of selection of rows 
or columns) and determine the frequency with which the value of a 
chosen criterion for the plan actually observed is equalled or ex- 
ceeded in the totality of possible plans. The effect of non-additivi- 
ties under such circumstances is essentially unknown. (Some more 
detailed discussion of this is given by Kempthorne [10, p. 149 and 
Section 12.6].) 

(b) if the observed treatment mean square is less than the observed 
error mean square and the variance ratio criterion is to be used in a 
randomization test it is difficult to see how a high level of signifi- 
cance can possibly be reached since, with given observations, the 
average content of treatment and discrepance mean squares are 
equal over the set of all randomizations. If one thinks of the sensi- 
tivity of an experiment as the average of squared treatment differ- 
ences divided by the average error in estimation of these, then it 
appears that if R and C are not large and row-treatment and 
column-treatment interactions are important then the overestima- 
tion of error may occasion a serious underassessement of the sensi- 
tivity (or value) of the experiment. The desirable direction to move 
in is toward a scale of analysis where interactions are less important. 
(While the overestimation of error is less serious for the randomized 
block design the increase in actual error will often more than out- 
weigh this. This matter is discussed further below.) 

(9) What we have said above has the consequence that if experimenters 
would follow a rule of discarding Latin square experiments in which the 
treatment mean square ig not significantly greater than the error mean 
square, say, they might well be discarding very good estimates of treat- 
ment main effect comparisons. 

(10) In view of the fact that it is desirable for an experimenter not only to 
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have “good” estimates but also to know how “good” the estimates are, 
the strong recommendation is made to attempt to transform to a scale 
where additivity more nearly obtains. How to proceed is an outstanding 
problem, and, in our opinion, one of the most important in the analysis 
of experiments. Offhand, it seems that the practice of transforming to 
get a high value of F, the variance ratio, a procedure which has fallen 
into disrepute (perhaps because experimenters were using it for the 
wrong reasons) has some commendable aspects. 

In circumstances where row-treatment and column-treatment inter- 
actions are important and R and C not large it may be that 1/t7* is a 
better estimate of o;* than 1/t(7*—D*). Also if for example RF is large 
and C equals ¢t the expression 1/2¢(27*—D*) may be a better estimate 
of ¢;? than the usual one. In evaluating such alternate estimations for 
o,’ the possible difficulties or advantages associated with under-or-over- 
estimation must be most carefully considered. 


COMPARISON WITH RANDOMIZED BLOCKS 


We now consider the relative merits of the randomized block design (Wilk 
[21, 23 II and VI]) and the Latin square design. We do not consider the 
randomized block and Latin square analyses of variance for a particular set of 
observations, but rather we consider the possible results over all randomiza- 
tions and technical errors for each design. The randomized block design in- 
cludes very many more plans than the Latin square design. For instance, for an 
experiment involving four replications for each of four treatments, there are 
576 different 44 Latin square plans and (4!)* or 576? different randomized 
block plans. The question is whether one should use a random one of the 576 
different Latin square plans or a random one for the 576? different randomized 
block plans each with the analysis appropriate to the design. A partial answer is 
given by considering the expectations of mean squares in the two cases. These 
are given in Table 4 for the case of rows as blocks in the randomized block 
design. The randomized block error mean square is denoted by I*. 

The expectations of Table 4 are in terms of the 2’s as defined for the Latin 
square. (They would have a rather different but very simple structure in terms 


TABLE 4 
EXPECTATIONS OF MEAN SQUARES WITH RANDOMIZED BLOCKS 


Mean Square Expectation 


C-t 1 


— 1) 


B* Zo + 
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of 2’s appropriate to a randomized block design.) We see that 1/t(7*—I*) is 
an unbiassed estimate of 2;+1/C2.: which is equal to o,7—1/Ro,;*. The bias in 
the randomized block analysis is —1/Re,,’. If C is small and column-treatment 
interactions are important then the bias in estimating o,* from a randomized 
block experiment may be appreciably less than if a Latin square had been used. 
On the other hand, if column main effects are important, the variance of com- 
parisons of treatment effects will be much increased, as can be seen from the 
excess of the expectation of the treatment mean square with randomized blocks 
over that with the Latin square, namely 


T 
1 ( 1 t-—1 
R\T Cc 
If R is large compared to r, « situation commonly referred to as “rows are 
random,” the comparison is given in Table 5. 


TABLE 5 


COMPARISON OF RANDOMIZED BLOCKS AND LATIN SQUARE 
(ROWS RANDOM) 


Expectation of mean squares 


Latin Square Randomized Blocks 


1 1 1 
Treatments Zo + 0.2 — + z) + 


1 
Error Zo Zo + — (= + 


From Table 5 we may guess that even though in this case randomized blocks 
allows unbiassed estimation of ¢,*, the Latin square will have in general beth a 
lower real and a lower apparent error. This serves to underline the fact that the 
Latin square may well give more accurate estimates of treatment comparison3 
even though their errors cannot be estimated unbiassedly. 

As a hypothetical example let C= T =t, ores? =0, Ore? = oer? = 074? = 00", =0, 
a? =20,". Then if we define the non-centrality factor (NCF.) to be 


E (treatment mean square) — FE (error mean square) 


2E (error mean’square) 


C T R RT 
t-—1 t-1 
2 
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we have 
Q(t 1) of 
6(5t — 2) ao? 
which is positive whenever o,?/o,’ is larger than 

2 1 
t—1 2u¢ — 1) 
With ¢ equal to 4 this is 3, so that if o,?>30o?, NCF(LS)>NCF(RB). 

In the case when columns are “random,” i.e. C is large compared to t, the 
situation is as given in Table 6. In this case we note that the bias in estimation 
of o,? is the same for both cases. The Latin square will however usually be 


better for estimation of treatment differences and will usually have a larger 
non-centrality factor. 


NCF(L-S) — NCF(RB) = 


2t 


TABLE 6 


COMPARISON OF RANDOMIZED BLOCKS AND LATIN SQUARE 
(COLUMNS RANDOM) 


Expectation of mean squares 


Latin Square Randomized Blocks 


1 1 
Error Zo Zo + («2 T Co? + — Tret 


The general conclusion from the results given above is that in general the 
Latin square design is likely to be better than the randomized block design 
though it is obvious that, if the column classification of experimental units 
should be useless in the sense of not accounting for excess variability, they will 
be essentially equivalent apart from the loss of degrees of freedom for error 
resulting from the use of column classification in the Latin square. 


A GENERALIZATION 


There are a number of sampling situations, in addition to the one considered 
above, within which a Latin square design might be embedded. For example we 
may have S sources of RC units classified as above; select and use ¢* units from 
each of s of them, and examine ¢ selected treatments in s Latin squares, each 
tXt. Our previous situation is a special case with S=s=1. The results for ex- 
pectations of mean squares for this case are given in Table 7. 

The appropriate population model for this situation has the form 


= +195 + + te + + (rh) + + (Tet) gin + 
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TABLE 7 
GENERALIZATION OF LATIN SQUARE DESIGN SITUATION 


Mean Square df Expectation 


S* (s 1) Zo + tz, + tz. + + 
R* s(t — 1) Zo + tz, 

ce a(t — 1) Zo + tz, 

7 (t — 1) Zo + + 

I*sr (s — — 1) Zo + 

s(t — 1)(¢ — 2) Zo 


Definitions of 2’s 


1 1 1 1 
t+ top RO 


1 
Let = Sret® 


= 


Lo = + + + Zee + Zee, 


where g=1, 2,---,S;i=1,2---, R (for each g);j=1, 2,---, R (for each 
g); k=1,2,---,T. 
The definition of the components of variation is then as exemplified by 
o? = —— 
1 


ore? = S(R 1) (re) ois"; ete 


The inverse relationship between the o*’s and the 2’s is as indicated by 


t at R rt C et RC iret 


cee fig ten ¢ 
T st R r RT rt CT ct ret 
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1 1 1 
1 1 1 
=.= - R Ore? + Ror 
1 
Zre = Ore? — 
1 
on? — Gres? 
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From Table 6 it can be seen that if S is large, then 1/st(7* —J,,*) is essentially 
unbiassed for ¢,?, and J,,* would be an appropriate error term for evaluation of 
observed treatment comparisons, and the assessment or estimation of ¢,?. 


MACHINERY: THE STATISTICAL MODEL 


We have found it of considerable value in many respects to formalize and 
summarize the implications of the randomization procedures and the various 
assumptions underlying the population model in an explicit statistical model 
for the observations. This will be described rather briefly for the main situation 
described above. 

We define various sets of “dummy” random variables which reflect and de- 
fine the essential features in the sampling and randomization procedure. They 
are as follows: 

(1) a= 1 if the i*th selected row is the ith row in the population 

=0 otherwise 

(2) B= 1 if the j*th selected column is the jth column in the population 

=0 otherwise 

(3) ve*"= 1 if the k*th selected treatment is the kth treatment in the 

population 
=0 otherwise 
(4) 5¢*%=1 if the fth replicate of the k*th selected treatment occurs on 
selected i* and selected column j* 
=0 otherwise 
and finally 
pe" = >>, 5e;°* which is unity if the k*th selected treatment occurs on 
the 7*th selected row and the 7*th selected column. 

The a’s, 6’s, y’s and 4’s are groupwise independent and all of their distribu- 
tional properties can in principle be written down or worked out. In the case 
of the 6’s (and the derived p’s) this may be difficult or, at present, impossible 
because it would require enumeration of all ¢<¢ Latin squares. It may be re- 
called that Welch [18] had to use a complex enumeration scheme to obtain 
some of the less elementary properties for 5X5 and 6 X6 squares. In the present 
case, where we are only concerned with first and second moments the necessary 
properties of the 6’s or p’s are readily obtained. (They are given with different 
notation by Kempthorne [10, Section 10.5].) 

Using these random variables, explicit models for the observations are 


= 
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An explicit expression for 24°; can be written down from the relations above. 
The case when the whole of a population is represented in the sample, is in- 
cluded formally in a simple way: for example, if all rows are included we let 
a;* =1 for é* equal to i and zero otherwise. 

The models given above appear to have a rather complex and formidable 
structure. One might ask if they are really necessary. Our own develooment 
of such models was part of a general investigation [21, 23] into the meaning of 
the linear models which are commonly used to present the analyses of experi- 
mental designs. Such apparently complex models arose quite naturally and we 
have found them to be easy to handle. They can be used to study many sta- 
tistical properties of interest, such as expectations of linear estimates, variance 
of estimates, expectations of mean squares, and estimates of error, by formal 
algebraic expectation procedures for any experimental design. There is no 
reason in principle why the models could not be used to find variances and 
covariances of mean squares. It may be expected however that more purely 
combinatorial arguments will be developed to handle these problems [Hooke, 9]. 

Quite apart from the fact that the over-all method has been useful in studying 
the standard designs, its most important attributes are: 

(1) it lays bare the dependence of the meanings of effects and interactions 
on the populations involved, e.g. of treatment effects, on the populations 
of rows, columns, ete. in the general experimental situation under study. 

(2) it exhibits explicitly the dependence of the meanings of effects and inter- 
actions on the scale of observation. 

(3) it brings into the open the necessary assumptions or considerations for 
meaningful interpretation of the analysis of variance. (Here “meaning- 
ful” is in contrast with writing down an aribtrary, albeit customary, 
linear model.) 

(4) it helps to indicate which assumptions are very important and which are 
unimportant. 

(5) it helps to bring to the fore one of the basic problems of quantitative 
experimentation, namely that of finding the functional structure rela- 
tionship of the variables underlying an experimental situation. 


LINEAR ESTIMATES AND ERRORS 


We conclude with a short section on estimation of effects. For simplicity we 
consider the case where treatments are fixed, i.e. 7’ =t, so that we can write an 
observation as 2. Then as in all cases of randomized designs that we know of, 
unbiassed estimates of linear population parameters are given by the same 
linear functions of appropriate sample means. In the present case, regardless 
of interactions and relative values of R, C, and ¢ an unbiassed estimate of 
is given by Difficulties arise in the estimation of errors 
of estimates. We may note the following cases: 

(1) with interactions of treatments with rows and columns negligible, 


var — 2!) = + 


| 
| 
j 
| 
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which is estimated unbiassedly using D*. This is of course well known 
and due to Fisher [6]. 

(2) as a guide under general conditions, the average variance of estimated 
treatment differences is, of course, directly connected with the expecta- 
tion of the treatment mean square and is 


t re RC ret R rt C cet t 


and this can be seriously overestimated by 2/tD* if non-additivities are 
important or R and C are not large. For randomized blocks with rows as 
blocks the average variance is 


and it is easy to visualize situations in which this average variance will 
be considerably larger than with the Latin square design. On the other 
hand the estimate of variance from the randomized block design may 
be better and less biassed. Where either design is reasonable, the Latin 
square is usually to be recommended but with some caution to the effect 
that its error term will usually be too large. 
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TRANSFORMATIONS TO NORMALITY USING FRACTIONAL 
POWERS OF THE VARIABLE 


P. G. Moore 
University College, London 


1, INTRODUCTION 


N MANY forms of analysis of nonnormal variables we seek some transforma- 

tion of the variate involved in order to make it approximately a normal 
variable. This enables us to utilize the normal distribution, which has been 
exhaustively investigated and tabulated, to obtain percentage points, ordinates, 
and so on for our original variable. If this is not done we may find it necessary 
to perform much laborious work in the tabulation of the variate itself, and 
functions of it, perhaps with only limited uses in view for the completed work. 
A second reason for a transformation is that sometimes the tables of the exact 
distribution are not available and it behooves us to find a quick and simple 
test. 

As an example of the latter consider the standard deviation, s=2(x—Z)?/n, 
in a sample of n drawn from a normal population with standard deviation oc. 
We know that ns?/o? is exactly distributed as a x? distribution but as an 
approximation we can assume s to be normally distributed with mean 
o[1—(1/4n) | and variance o?/2n thus enabling us, at any rate for n larger than 
about 15, to utilize the normal distribution for a test of significance. 

Of course, in many cases, where observations fall into a number of groups, 
it is more important to equalize the variance of these groups than it is to bring 
the variate into normality. Thus in the analysis of variance, although both 
items are desirable, it is of more importance, except possibly for a one-way 
classification, to stabilize the variance otherwise it would never be known if 
one effect was really significant or not. 

This present paper examines the properties of a certain class of transforma- 
tions under the assumption that they are designed to transform the variate, X, 
into some form of normal distribution. The class of transformations considered 
is that of 


Xt where 0<r<l (1.1) 


and X is a non-negative variate. As further developments we consider the 
transformation to a Type III variate and the use of (X +a)’ where a is a random 
variable in the interval [0, 1]. 


2. PROCEDURE ADOPTED 
The method used to investigate this problem will be analogous to that put 
forward by Haldane [4] when he was interested in the distribution of the 
volume of objects given that the three basic measurements for the volume were 
each normally distributed. Thus instead of considering a general variate X and 
seeing the effect of taking the rth power of it we will consider a normal variate 
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and examine the distribution of 1/rth power of that variate. We can find the 
B, and B, of the new variate and since they must both be dimensionless and 
also functions of the first two moments only of the normal curve they must be 
functions of the coefficient of variation. 
Let 
X = + (2.1) 


where z is a normal variable with mean zero and standard deviation unity, thus 
making X a normal variable with mean m and coefficient of variation (defined 
as the standard deviation divided by the mean) equal to k/*. We assume that 
m is large and positive and k is small so that 


P{X <0} <e 


where ¢ is extremely small. We may now consider the moments of X for which 
we require to evaluate 


E(X‘) = m'‘E(1 + (2.2) 


where ¢ is greater than unity and has both integer and fractional values. In the 
case of the transformations r=}, 4, } and so on the values of ¢ needed will 
always be integral and thus exact values of the crude moments of X*, X*, X4, - - - 
may be obtained and from them the values of the moments about the mean. 
Finally the values of the moment ratios can be obtained. When the value of r 
does not take one of these simple forms, for example if the numerator of the 
fraction is not unity, the value of ¢ in (2.2) will frequently be fractional and 
hence the moments of the transformed variate can only be found as a power 
series in k. If k is small it will be adequate to omit the higher powers but this 
limitation must be borne in mind. 


8. RESULTS OF TRANSFORMATION 


The results of the procedures of the preceding paragraph carried out for 
various values of r are given in Table 1 as functions of k. For the first two 
cases the results are only approximate whilst for the last three they are exact. 

TABLE 1 
MOMENTAL RATIOS OF zc IF z IS NORMAL 


Bi Be 


k(1 +0 .7963k +0 .336763k*) 3(1 +0 .148148k +0 .021602k? +0 .003086k*) 
2.25k(1+0.291667k +0 .06423k?) | 3(1+0.64+0.16k*) 

8k(3+k)*(2 3(4+20k +5k*)(2+k)* 

108k (3 + 16k + 15k*)2(3 + 12k 3 +-72k(7 +48k +75k* + 15k*) (3 + 12k +5k*)—* 
162k (2 +32k + 149k* + 198k? +33k*)*| 3+ (4124k +94201k* +70735k* 


X(2+21k +48k? + 12k*) +204250 1k‘ + 1880525k + 2350084*) 
X8(2+21k +48k* + 12k*)* 


| 

4 
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Numerical values for the five cases considered have been calculated and are 
plotted in Fig. 1 on a f;, 6, diagram. In this figure the lines are drawn as straight 
lines which, for the small values of k involved, is almost exactly true. However 
if the figure were extended so as to include larger values of 6, and 62 this would 
no longer be the case. It is possible to join up the lines and draw in contours for 


+o 


Fia. 1 


k superimposed on the contours for r which are already on the diagram. These 
contours for k are, however, very close to those for r and only a few have been 
drawn in the figure. We can see from the diagram that a change of r may quite 
radically alter the value of k even though the (:, 82) points of the transformed 
variates may be very close indeed. Of course in the practical applications of 
such transformations we would normally make the first two moments of the 
transformed variable equal to those of the normal curve that we used to ap- 
proximate to it thus ensuring that the correct value of k was being utilized. 
Thus it is also useful to plot the values of c, the coefficient of variation of the 
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resulting transformed variate, x". This has been done in Fig. 2 and with the 
aid of this figure one can see now whether a variable has both the correct 
momentai ratios and the correct coefficient of variation to be transformed into 
an approximate normal distribution. 


32 


Coetticient of variation, c,ot x 
Fie. 2 


4. TRANSFORMATIONS USING x? 


There seems no reason why transformations should always, of necessity, be 
made to the normal distribution. Other distributions have also been well tabu- 
lated and it is possible to utilize, say the x? or Type III distribution for the 
purpose. Cadwell has approximated to some distributions [1] by taking (x?)” 
and finding » by equating the moments. Here we have taken y as a Type III 
variate where 
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This has 
m'(y) = 
u(y) = (4.1) 
Coefficient of variation = p-/? 


and we have considered first the distributions of y? and y* for which the mo- 
mental constants, in terms of p, come to be: 
For y? 


8(5p" +17p+ 1 (2 
+1)(2p+3) 
14 23 129 


3(1+— 


5 185 


For y'/? we find 

= (fa — pa + 2a*)*/($p — 

= (4p + tp? — 2a? + pat — — a)? 
where 


a = + $p)/T (4p). 


In Fig. 3 some specimen values of these transformations are drawn and super- 
imposed on them are some contours for c, the coefficient of variation of the 
transformed variable. Thus in order that a particular transformation should 
work well the variable ought to have not only the appropriate value of 6; and 2 
but a coefficient of variation equal to c. 


5. TRANSFORMATION OF x? 


Of course a Type III or x? variate may be transformed into a normal variable 
and it is of interest to examine the Wilson-Hilferty proposal, put forward in 
[9], from this point of view. Wilson and Hilferty took (x?/v)* to be normally 
distributed with a mean 1—2/9 and standard deviation +/(2/9v) where » is 
the number of degrees of freedom. Now we know that for x? the following values 
hold 


mw’ =» w= 2 =8/y Be =3+12/r. 


65 
2 2 
(4.2) 
For 
8 (1+) 
— + 
4p? 
Pp p Pp (4.3) 
110 708 
=3+—+—. 
p p 


AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1957 


02 os 


Transformations cf a Type IT 
3 


Taking some specimen values of v we give in Table 2 the (8, 62) of the x? dis- 
tribution together with the coefficient of variation, V, of (x?/v)'/*. Against them 
are the (61, 82) of the cube root transformation nearest to the observed point in 


TABLE 2 
ACCURACY OF WILSON-HILFERTY TRANSFORMATION 


x? distribution Cube root transformation 


Bs Bo 


Sassss 
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’ 38 
40 
Bi 
0.4 3.6 0.011 0.388 3.613 0.011 
0.267 3.4 0.007 0.259 3.412 0.007 
0.2 3.3 0.006 0.194 3.312 0.006 
0.16 3.24 0.004 0.155 3.252 0.004 
0.107 3.16 0.003 0.104 3.172 0.003 
| 0.08 3.12 0.002 0.078 3.131 0.002 
| 
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a (61, 82) diagram together with the coefficient of variation, V’. We notice that 
the coefficients of variation are in exact agreement which demonstrates that 
the transformation should be a good one and this is in fact known to be the 
case. 


6. EXAMPLE ON RANGE 


We will now illustrate by means of two examples how the knowledge given 
to us by Fig. 1 may suggest the correct transformations, or at any rate suitable 
transformations, to bring various distributions into approximate normality. 
First we will consider the range in random samples of size n from a normal 
population. The form of the distribution varies enormously with the value of 
n and the values of 6; and 6 for n up to 20 have been given [5]. Some specimen 
values from that paper are given in Table 3 and if these are plotted in a (61, B:) 


TABLE 3 
MOMENTAL RATIOS FOR DISTRIBUTION OF RANGE 


3 5 7 10 13 16 19 


Bi 0.4176 0.2174 0.1742 0.1580 0.1559 0.1576 0.1612 
Bs 3.286 3.169 3.174 3.200 3.220 3.237 3.254 


diagram it can be seen that no one method of approximation is likely to be of 
general use. Previous attempts at approximating have used the x? distribution, 
see [2], or the x distribution, see [6], whilst Pearson [7] has shown how, 
broadly speaking, the former is betier for n greater than 10 whilst the latter is 
better for n less than 10. A similar situation holds for the type of transformation 
that we are considering here. Range is essentially positive and a comparison of 
its momental constants with Fig. 1 shows that, for low values of n, w*/* would 
be approximately normal whilst for larger values of n, w'/* would be approxi- 
mately normal. Since the commoner uses of range, due to its efficiency declining 
with increasing n, are for small n, we will consider small values in some detail 
taking 2<n<10. To obtain the appropriate normal distribution the expression 


2/3 
wt = ==) where § = E(w) 


was used to estimate the first two moments of w*/* and hence get the probability 
integral for w. This has been tabulated in Table 4 for the two cases of n equal 
to 5 and 10 and w going up in steps of 0.2. From this table we find that the 
maximum error occurs in each case just below the median value and is equal 
to 0.0071 when n is 5 and 0.0093 when n is 10. Approximate percentage points 
for range can now be found by assuming w*/* to be a normal variable and using 
the known percentage points of a normal distribution. The true probability 
corresponding to such percentage points can be found from the tables of the 
probability integral of range given by Pearson and Hartley [8] and reproduced 
in Biometrika Tables for Statisticians. The true probabilities corresponding to 
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TABLE 4 


COMPARISON OF TRUE AND APPROXIMATE CUMULATIVE 
PROBABILITIES FOR RANGE 


n=5 n=10 n=5 n=10 
w w 

Approx. True | Approx. True Approx. True | Approx. True 

0.2 | 0.0008 0.0001 _ _ 3.2 | 0.8445 0.8429 | 0.5777 0.5864 

0.4 | 0.0034 0.0014 — -— 3.4 | 0.8884 0.8859 | 0.6704 0.6769 

; 0.6 | 0.0102 0.0068 | 0.0001 _ 3.6 | 0.9218 0.9192 | 0.7521 0.7558 
0.8 | 0.0241 0.0200 | 0.0004 0.0001 | 3.8 | 0.9468 0.9441 | 0.8205 0.8212 
: 1.0 | 0.0484 0.0450 | 0.0013 0.0005 | 4.0 | 0.9647 0.9623 | 0.8746 0.8731 
1.2 | 0.0860 0.0847 | 0.0040 0.0022 | 4.2 | 0.9772 0.9752 | 0.9158 0.9126 

1.4 | 0.1386 0.1400 | 0.0102 0.0071 | 4.4 | 0.9856 0.9840 | 0.9453 0.9416 

1.6 | 0.2058 0.2101 | 0.0227 0.0184 | 4.6 | 0.9911 0.9899 | 0.9658 0.9620 

1.8 | 0.2858 0.2920 | 0.0449 0.0403 | 4.8 | 0.9946 0.9938 | 0.9793 0.9759 

2.0 | 0.3745 0.3816 | 0.0805 0.0768 | 5.0 | 0.9968 0.9963 | 0.9879 0.9851 

2.2 | 0.4672 0.4739 | 0.1321 0.1307 | 5.2 | 0.9982 0.9978 | 0.9932 0.9911 

2.4 | 0.5590 0.5643 | 0.2003 0.2025 | 5.4 | 0.9990 0.9987 | 0.9963 0.9948 

2.6 | 0.6452 0.6487 | 0.2838 0.2893 | 5.6 | 0.9994 0.9993 | 0.9980 0.9970 

2.8 | 0.7228 0.7242 | 0.3783 0.3867 | 5.8 | 0.9997 0.9996 | 0.9990 0.9983 

3.0 | 0.7895 0.7891 | 0.4785 0.4878 | 6.0 | 0.9998 0.9998 | 0.9995 0.9991 


the percentage points found by the normality assumption are given in Table 5 
for the cases where n is 5 and 10. Although the errors are slightly greater for the 
upper than for the lower tails, they are not of very great magnitude. One re- 
sult of great utility that flows from using such a transformation is that it 
enables us to combine ranges from various sizes of sample. This follows since 
the distribution of a linear function of a number of independent normally 
distributed variables is itself normally distributed. 


TABLE 5 
COMPARISON OF EXACT AND NOMINAL PROBABILITIES FOR RANGE 


True probability 
Nominal 
probability n=5 n=10 
level 
Upper tail Lower tail Upper tail Lower tail 

: 0.10 0.103 0.098 0.103 0.096 
i 0.05 0.053 0.046 0.054 0.044 

0.025 0.027 0.021 0.028 0.022 

0.01 0.011 0.007 0.017 0.007 


’ 7. DISTRIBUTION OF MEAN DEVIATION 


As a second example we consider the distribution of the mean deviation in 
samples from a normal population. This distribution has an awkward form to 
handle mathematically but from the values of 6, and f given in Table 6 it 
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TABLE 6 | 
MOMENTAL RATIOS FOR DISTRIBUTION OF MEAN DEVIATION 


n 5 7 10 13 16 19 


Bi 0.230 0.157 0.106 0.081 
Ba ° 3.197 3.136 3.093 3.070 3.057 


would seem that except for very low values of n the two thirds root should be 
an appropriate transformation to use. In Table 7 a comparison is made as for 
the previous example on range giving the exact probability found from a table 
of the probability integral of mean deviation due to Godwin [3] that corre- 
sponds to the nominal percentage point of mean deviation found by assuming 
the two thirds power is normally distributed. The agreement between the 
nominal and true levels seems to be very good thus suggesting that little error 
would be made in taking (mean deviation)** as a normal variable. 


TABLE 7 


COMPARISON OF NOMINAL AND TRUE PROBABILITIES 
FOR MEAN DEVIATION 


True probability 


Nominal 
probability 
level 


8. OTHER APPLICATIONS 


{any other distributions may be dealt with in a similar fashion. Thus it is 
known that to stabilize the variance of a Poisson variable it is best to take 
(2+#)? where z is the Poisson variable. Following the procedures outlined 
above we are led to using (r+}4)*/* as the best transformation to convert the 
variable into normality and calculations show that even for the Poisson pa- 
rameter as low as 2 good results are obtained. The method of attack is a power- 
ful one in general and for good results to be obtained it is essential not merely 
to relate the 6; and £ of the original variable to the correct fractional power but 
also to ensure that the variable has a suitable coefficient of variation, adding a 
constant to the original variable if necessary. In attacking such distributions 
as the Poisson where the variable is discontinuous an improvement can be 
made by considering the addition of a rectangular variable, that is by taking 


0.054 
3.047 
Upper tail Lower tail Upper tail Lower tail 
0.10 0.103 0.099 0.102 0.099 
0.05 0.050 0.050 0.052 0.048 
0.025 0.028 0.021 0.027 0.023 
0.01 0.0i1 0.006 0.011 0.008 
0.005 0.006 0.002 0.006 0.004 
0.001 0.000 0.001 0.001 0.001 
| 
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y = (x (8.1) 


where z is the Poisson variable with parameter \, c is a constant and z a rec- 
tangular variable distributed from —4 to +4. By finding the moments of y and 
best value thus leading us to take 


= (x + 2)" (8.2) 
where 
p(z) = 1 


The moments of Y can be accurately computed using a table of Poisson prob- 
abilities and we find, for example, that the skewness of Y approaches zero very 
rapidly indeed some specimen figures being 


» 0.5 1.0 


B,(Y) 0.241 
Bi 2 


Further the actual probability distributions have been compared for the case 
of A equal to two by finding 


Poisson proba- 
bility 0.01 0.05 0.1 0.3 0.5 ‘ 0.9 ‘ 0.99 


Normal approx.| 0.014 0.043 0.096 0.294 0.500 0.696 0.904 0.949 0.991 


X =z+z for various probabilities and then taking Y =z? as being normal 
finding the corresponding normal probability. There is very good agreement 
here even though the value of \ is very small. 
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LINEAR TRANSFORMATION TO A SET OF STOCHASTICALLY 
DEPENDENT NORMAL VARIABLES 


J. Moonan 
U. 8. Naval Personnel Research Field Activity, San Diego* 


It is the purpose of this paper to present a method which will enable 
the construction of a set of stochastically dependent normal variables 
from a set of independent normal variables. 


1. INTRODUCTION AND STATEMENT OF THE PROBLEM 


ao [4, App. B of Chap. 8] has shown two ways to derive a linear transfor- 

mation from a stochastically dependent set to an independent set of 
variables. These procedures have great utility for multivariate analysis prob- 
lems. Occasionally it is of interest, however, to transform the other way, that is 
from an independent set to a dependent set. Such transformations could be 
used to facilitate students’ understanding of multivariate statistical theory, 
particularly in the areas of discriminant, component, and factor analysis, by 
means of sampling experiments. In fact, in more general applications of Monte 
Carlo methods to numerical as well as statistical analysis, data developed 
through such transformations would be very useful. 

Formulas which can be used for developing a set of two Gaussian variables 
which are correlated to any degree are given in the Rand tables [3], and 
Fieller et al. [1] provide a table of nine bivariate-normal samples of size 3000 
which are samples from populations with correlations of .1, .2, .3, - - - , .9. The 
more general problem of generating random samples of any size from a normal 
population of any number of variables is nct so well known. This is the problem 
considered in this note. 

We consider a vector set X’=[zx(1), 2(2),---, 2(V)] of V stochastically 
independent variables with 


E[z(v)] =0,0=1,---,V and E[XX’]) =I (1) 
where / is an identity matrix of order VX V. We desire to transform X, via the 
linear mapping 

Z= AX (2) 
where Z’ = [z(1), 2(2),---, 2(V)] and A is a lower-triangular, nonsingular 
matrix of order VX V. Thus A=a(ij); i, 7=1, ---, V with all a(j), for 
equal to zero. We also require the 

E[zv)] =0; v=1,---,V and 

E[ZZ'| = E[AXX’A’'] = AE[XX’']A’ = AIA’ = M 
where M =m(ij) is some specified VV nonsingular symmetrical variance- 
covariance matrix. In a particular case, M might be a product-moment correla- 


* The opinions expressed are sole’v those of the author and are in no way official; nor are they to be construed 
as representing those of the U. 8. Naval Personnel Research Field Activity or Bureau of Naval Personnel. 
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tion matrix with elements p(v, v’) where, when v=v’, p(v, v’) =1. Otherwise 
p(v, v’) is the product-moment correlation between variables v and v’. 


2. THE SOLUTION 


The problem is essentially one of reducing a symmetric matrix into the 
product of a triangular matrix, A, and its transpose. Numerical analysts refer 
to this problem as Choleski factorization. Due to the triangular character of 
A, it is very simple to determine its elements, corresponding to a particular M, 
by elementary transformations applied on the rows of M. In general, 


min(i,7) 


M = m(ij) = AA’ = p> a(vija(yj) ij=1,---,V (4) 


where min (i, j) is the minimum of the numbers i and j which correspond to 
designators of the rows and columns of AA’. By successive elementary trans- 
formations, (4) is reduced to 


A = = =1,---,V (5) 


where, for i>j, all elements of A are zero. In other words, AA’ is reduced to 
the upper triangular matrix, A. If the elements in the ith row of A are divided 
by a(ii), we obtain A’. 
For 1<j, (4) can be written as 
i-1 


m(ij) = a(vi)a(vj) + a(ii)a(ij) (6) 
pas] 


whereby an explicit solution for the a(ij) can be obtained, for a(iz) >0, through 
the equations, 


= [ mi -2 | fold). 


Some special cases of (7) give 
a(11) = /m(11) > 0 
a(1j) = m(1j)/a(11) 
a(22) = /m(22) — a%(12) >0 
a(2j) = [m(2j) — a(12)a(1j) ]/a(22) 
a(33) = »/m(33) — a%(13) — a*(23) > 0 
a(3j) = [m(3j) — a(13)a(3j) — a(23)a(37) ]/a(33). 


For a particular M the calculations required by (8) lend themselves especially 
well to systematic solution by means of the Square Root Method. An illustra- 
tion of this is provided in section 3. Before coming to that, however, it will 
now be shown that the z(v) are, in fact, multivariate normal variables. 

The density function of the z(v), — © <2(v)< ~, is 


| 
| 


TRANSFORMATION TO DEPENDENT NORMAL VARIABLES 249 
P[z(1), 2(2),++-,2(V)] = exp [—4X'X] (9) 


where the quadratic form in the exponent of (9) is positive definite. 
Since X = 


X'X = AZ = Z'(AA')'Z = (10) 


The Jacobian of the transformation (2) is the absolute value of | A’-*|, there- 
fore the density function of the 2(v), — © <2(v)< @, is 


Ple(1), 2(2), ++, 2(V)] = exp [-32’M'Z] (11) 
where || A’—"|| specifies the absolute value of | A’-'|. However, 
_ if we take the positive square root. Consequently (11) becomes 
P[e(1), 2(2),- ++, 2(V)] = exp [-42’M-1Z] (13) 


which is the multivariate normal density function of the z2(v) whose expectations 
are zero and variance-covariance matrix is equal to M. 


3. AN EXAMPLE 
Consider the special case where V =3, then 
a(il) a(12) a(13) 
A’'=/0 a(22) (23) 
0 0 a(33) 
and AA’ is symmetrical and of the form 
a*(11) a(11)4@(12) a(il)a(13) 


a(11)a(12) > a*(v2) > a(v2)a(v3) 
. 


 a(11)a(18) a(v2)a(08) 


By multiplying the elements in the first row of (15) by a(12)/a(11), and sub- 
tracting these products from corresponding elements in the second row and 
also multiplying the elements in the first row of (15) by a(13)/a(11) and sub- 
tracting these products from the corresponding elements in the third row, we 
obtain, 

a*%(11) a(il)a(12) a(11)a(13) ) 

0 a*(22) a(22)a(23) (16) 

0 a(22)a(23) a*(23) + a*(33) 
Similarly, if we multiply the elements in the second row of (16) by a(23)/a(22) 


and subtract these products from the ere elements in the third row 
of (16) we obtain, 


| (14) 
AA' = (15) 
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a*(11) a(ilja(12) a(11)a(13) 
A = a(ij) = |0 a*(22) a(22)a(23) | . (17) 
0 0 a*(33) 
Now if we divide each element in each row of (17) by the square root of the 
leading nonzero element in the row, then we obtain A’ which provides the 
basis for the linear transformation Z’= X’A’ or Z= AX. 
Consider a numerical example where, 
8.0 2.0 3.0 
M=/|2.0 2.0 1.2]. | (18) 
3.0 1.2 2.0) 
Thus we desire to construct a set of variables which will represent a random 
sample from a three-variate normal population for which the variance of z(1) 


is 8.0, the covariance of 2(1) and 2(2) is 2.0, etc. The calculations used to obtain 
the elements of A’ b, the square-root method are given in Table 1. The first 


TABLE 1 


THE SQUARE-ROOT PROCEDURE FOR OBTAINING A’ BY ELEMENTARY 
TRANSFORMATIONS ON M 


Columns 


Rows Instructions 
C(1) C(2) 


R (1) 
R (2) 
R (3) 


R (4) 
R (5) R(2) —[R(2)C()JR(4) 
R (6) 


(Vv) 
(Vv) 
(Vv) 


(v) 
(Vv) 
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R (7) R(S) +(R(5)C(2)] 
R (8) R(6)—[R(6)C(2)}R(7) 


R(10) R(5) +V[R(5)C(2)) 
R(11) -R(8) + V[R(8)C(3)] 


0 0 (Vv) 


* This value need not be calculated since it is equivalent to the number in [R(5)C(3)], by symmetry. 


three rows of Table 1 present M together with a check column found by 
summing the elements of each row of M. To obtain row 4, i.e., R(4), we divide 
the elements in row 1 by the element which occurs at the intersection of row 1 
and column 1, i.e. [R(1)C(1)]. Row 4 is called the “first pivotal row.” To 
obtain the entries in row 5, we subtract the products of the element occurring 
in row 2 and column 1, ie., [R(2)C(1)]=2.0, and the elements in row 4 from 
the corresponding elements in row 2. This process, which is known as “sweep- 
out,” is continued to obtain row 6. This completes the first set of calculations. 


R (9) io .707 (Y) 
A’ 


| 
| 
C(3) 
8.000 3.000 13.000 
M}2.000 1.200 5.200 
3.000 2.000 6.200 
1.000 .375 1.625 
0 | -450 1.950 
0 * (875 1.825 
| >. 
0 740 
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The check marks (/) in the check column of Table 1 signify that the calcula- 
tions agree for both “Between” and “Within” rows, i.e., 6.2 — (3.0) 1.625 = 1.325 
agrees with .450+.875 = 1.325. 

In the next set of calculations given in Table 1, row 4 is ignored and we con- 
struct a new pivotal row, row 7, by dividing row 5 by [R(5)C(2) | =1.500. Now 
the sweep-out process is started on the second column. We need only to sweep- 
out the .450 from row 6 in this section. This gives us row 8. If we were to 
construct the next pivotal row, it would be obtained by dividing the elements 
in row 8 by .740. 

In the last section of Table 1, rows 9, 10 and 11, we divide all the elements of 
the rows used to obtain pivotal rows (rows 1, 5, and 8) by the square root of the 
leading nonzero element of those rows and record the quotients in rows 9, 10 
and 11, and columns 1, 2 and 3. The numbers in those rows and columns 
constitute the elements of A’. Therefore the equations given in (19) correspond- 
ing to Z=AX, when used with a table of random normal deviates with mean 
zero and unit variance, can be used to generate a set of variables with variance- 
covariance properties given by (18). 


2(1) = 2.8282(1) 
a(2) = .707x(1) + 1.2252(2) (19) 
2(3) = 1.0612(1) + .3672(2) + .8602(3) 


Tables of independent random normal variables may be found in various 
places [2, 3, 5, 6]. For example, from the first page of the Rand table of normal 
deviates [3], we might select the first column on the page to represent a set of 
x(1), the second column to represent a set of z(2) and the third column to repre- 
sent a set of z(3). Then using the formulas given by (19) we could obtain a 
random sample of size N = 50. The data might be arranged as in Table 2. It is 
easy to program IBM machines to make the transformations electronically 
instead of doing it mechanically on desk calculators and, indeed, to do the work 
shown in Table 1. This is advisable if V or N is large. The mean of any of the 
z(v) can be changed by adding a suitable constant to each of the elements in 
the z(v)th column of Table 2. This is equivalent to adding a suitable constant 
to each of the equations of (19). 


TABLE 2 


PRESENTATION OF A SET OF z(v) AND THEIR TRANSFORMATION 
TO 2(v) FOR A SAMPLE SIZE OF 50 


Variable 
a(1) 2(3) 2(1) z(2) 2(3) 

Unit 
0000 —-1.276 -—1.218 — .453 —3.609 -—2.394 -—2.190 
— .318 — .799 —1.664 — .899 -—1.204 —2.062 


-498 1.072 1.587 1.408 1.665 2.269 
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An important special case of this type of analysis occurs when V =2 and we 
wish the two variables to be correlated to the degree p with each variable 
having zero mean and unit variance. In such a case, a(11) =1, a(21) =p and 


a(22) = +/(1—p*) so that 


2(1) = 2(1) and 2(2) = px(1) + — p*)2(2). (20) 


This particular result, as previously mentioned, is also given in the Rand 
tables [3, p. xxiv]. 
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THE REGIONS OF UNIMODALITY AND POSITIVITY IN THE 
ABBREVIATED EDGEWORTH AND GRAM-CHARLIER SERIES 


D, Bernpt 
U. S. Weather Bureau* 


0. On the notation. In the literature it is common to use u and o to denote, 
respectively, the population mean and standard deviation and so these are used 
here. Since there is more than one symbol prevalent these days for the coef- 
ficient of skewness it is well to state at the outset that in this paper a; is used 
to denote this parameter. In practice, these population parameters are usually 
unknown and estimated from the sample. 

1. Introductory remarks. The methods of fitting a curve to an observed fre- 
quency distribution when the mathematical form of the frequency function is 
unknown are considerable in number. Many of these methods are discussed at 
length in [1, 2, 3, and 4]. The statistician faced with the problem of fitting a 
curve without much a priori knowledge is familiar with at least some of these 
various methods and with the fact that each method has its disadvantages and 
restrictions. 

Occasionally, it is appropriate and convenient to use the Edgeworth or 
Gram-Charlier series (see, e.g., [1]) as an approximate theoretical model in 
computing ordinates and thus estimating frequencies of an unknown frequency 
distribution. As is well known, the use of these series involves the population 
moments or, more often, estimates of the population moments obtained from 
the sample. In practical application, it is usual to make use of the first four 
moments, from which the mean, variance, skewness, and kurtosis, or their 
estimates are obtained, and apply these in the first three terms of the series in 
developing the ordinates of a frequency curve. But because the Edgeworth and 
Gram-Charlier curves are multimodal and/or produce negative values of the 
ordinate under certain conditions of skewness and kurtosis, their use is not as 
general as one might prefer. Barton and Dennis [5] have recently derived the 
conditions on the skewness and kurtosis under which these two curves are 
multimodal and positive definite. 

It happens not infrequently, however, that in practical application an esti- 
mate of the fourth moment (and therefore the kurtosis) is not available, or for 
reasons of economy or expediency is not calculated, and it is desired then to 
use only the first two terms of the Edgeworth series (which are also the first 
two terms of the Gram-Charlier series), herein referred to as the abbreviated 
series, in estimating the ordinates or theoretical frequencies from the data cf 
the sample. One can also envisage circumstances in which the statistician 
contemplating the use of these series has reason to believe that his data are 
reasonably mesokurtic in distribution and that his chief problem is that of 
correcting for skewness. In this event again, it might be concluded that the 
abbreviated series is, for practical purposes, as useful as the series would be if 
more terms were retained.' 


* Now cn assignment with the President’s Advisory Committee on Weather Control. 
1 There is a discussion of the order of magnitude of the terms of this series in [1]. 
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Using the abbreviated series for these reasons, or others, in approximating 
an unknown frequency function has the restriction that now, essentially as- 
suming mesokurtosis, negative ordinates become certain over some range of 
the standard variate t=(z—,)/o unless ag is exactly zero. This is apparent 
from the results of Barton and Dennis [5]. Knowing this, the inclination of the 
statistician might be to avoid the abbreviated series in the very circumstances 
that otherwise seem most natural for its application. Thus the question still 
exists: Over what range of t, given as, is the ordinate of the approximation 
given by the abbreviated series always positive and the curve always unimodal? 

Knowing the regions in the t, as-plane where the curves become multimodal 
and/or where the negative ordinates occur is a first consideration in using the 
abbreviated series. It is the purpose of this paper to determine these regions 
mathematically, over the useful range —3.5<t<3.5, —4.5<a;<4.5, and to 
portray them in a convenient, graphic form. 

2. Mathematical interpretation of the abbreviated series. When the abbreviated 
series is used, the ordinate of the frequency curve is given by 


= — (23/316) (1) 


where 
= (2x) exp 
and ¢®(¢) is its third derivative, or 
= 


The maxima and minima of f(t) are first examined for regions of multi- 
modality. This is accomplished by setting the derivative 


=0 (2) 


and solving for a; when given values are substituted for ¢. The resulting solu- 
tions represent, then, the roots of ¢ in the biquadratic when a; is specified. 
These roots are shown in Fig. 1. 

The condition for a positive ordinate is f(t) >0, and when this inequality is 
solved in terms of ¢ and a; we have 


6(t)/o(t) > aa for t< — O<t < (3) 
and 
6O(t)/o(t) < ay for > (3)", —(3)¥*<t<0 (4) 


which represent the outer boundaries for positive ordinates. These outer bound- 
aries are also shown in Fig. 1. | 

3. Interpretation. In the solutions of (2) shown in Fig. 1, it is seen that the 
regions of maxima and minima are located symmetrically about the origin, 
as is to be expected by the nature of the equation. Likewise, in the solutions of 
(3) and (4) the regions of negative (positive) skewness are symmetrical with 
respect to the origin. 
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For a range of approximately | a3;| <1.6480, the curves are unimodal with 
maximum ordinate at ¢$0 according as as320 and the value of the minimum 
ordinate, which occurs on the side of the mode opposite the “tail,” negative 
for finite ¢. 

For values of |a3| >1.6480 approximately, the solutions of (2) have four 
real roots and there are two maxima and two minima for the curves over the 
range of t. The secondary maximum of each curve occurs on the “tail” side of 
the mode. 

All curves have a negative minimum on the side of the mode opposite the 
“tail.” All multimodal curves have a minimum on the “tail” side of the mode, 
but this minimum ordinate is not negative except when | a;| >3.0. 

4. Summary and conclusion. The regions of unimodality and positivity in the 
abbreviated Edgeworth and Gram-Charlier series are developed and presented 
in a convenient form. Fer analysts using or contemplating using the abbreviated 
series to approximate unknown frequency distributions, it will be useful to 
consult Fig. 1 to determine the range of t over which the approximations will 
be meaningful for given as. 
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THE PRESIDENT’S ECONOMIC REPORT* 


RussELL 
Prudential Insurance Company of America 


Ix January of each year the President transmits to the Congress three im- 
portant documents—the State of the Union message, the Budget, and the 
Economic Report. The Economic Report, the newest of these documents, is pro- 
vided for under the Employment Act of 1946. The President’s Economic Report 
has become a document of great significance to the American people and of 
special interest to students of the American economy. The Report of January 
1957 offers a thorough analysis of the forces operating in the economy and a 
discussion of population changes and prospects. The Report also sets forth the 
economic philosophy of the Administration and the policy recommendations it 
believes should be followed in order to ensure a strong and growing economy. 
Of special interest to statisticians is the wealth of data (covering almost 100 
pages) relating to prices, production, employment, income, and a multitude of 
other measures of the United States economy. 

This paper provides, first, a general review of the Report’s highlights and, 
secondly, a discussion of the areas in which the Report’s policy recommenda- 
tions seem weak or in need of amplification. 


GENERAL REVIEW 


The Report contains an excellent account of the forces operating in the 
American economy during the period 1953-1956. it traces the decline in inven- 
tory and defense expenditures in 1953 and 1954 and discusses how govern- 
mental policies contributed significantly to the stability of incomes and total 
demand. It discusses the gathering forces of recovery from 1954 into 1955 and 
development of the subsequent prosperity which in 1956 reached “boom” pro- 
portions. 

There is a careful and detailed analysis of the course of the economy in 1956 
with special attention being paid to major sources of strength, sectors under- 
going adjustment, and cost and price trends. The Report highlights the record 
high business outlays for production facilities, rising employment, and rising 
consumer incomes and spending. It emphasizes the role of Federal price sup- 
ports in sustaining and helping to reverse the prolonged decline in farm income. 
Despite price supports and the Soil Bank Program, the basic problem of agri- 
culture, that of adjusting the volume and composition of agricultural commodi- 
ties to the domestic and foreign demand, remains unsolved. Appropriately, 
considerable attention is devoted to a discussion of the causes for rising costs 
and prices during the year. All three major categories of wholesale prices rose 
and significant increases were registered in industrial prices. Consumer prices 
rose 2.9 per cent from December 1955 to December 1956. An unprecedented 
total demand for funds placed strong pressure on financial markets. Interest 


* An invited review article on the Economic Report of the President, January 1957 (Washington: United States 
Government Printing Office, 1957). Pp. xi, 200. $0.65. 
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rates rose sharply as demand pressed against the limited supply of funds. 
Because of the pressure upon resources and productive capacity, and a conse- 
quent rise in prices, it was essential that the Federal Reserve follow a monetary 
policy designed to prevent an undue expansion of credit. 

As to the outlook for 1957, the Report cautiously advances the view that 
there will be a continued high level of employment, production, and earnings. 
It then sets forth a broad range of policy recommendations, many in the form 
of legislative proposals to the Congress, in order to “promote maximum em- 
ployment, production, and purchasing power,” the goals of the Employment 
Act. The policy recommendations include such varied subjects as state and 
local debt limits, ways to promote thrift, improving competition, agricultural 
policies, economic relations with foreign countries, mortgage credit, voluntary 
health insurance, and atomic energy. 

The economic philosophy of the Report rests upon the mutual cooperation of 
government, business, and labor. To maintain a healthy and growing economy 
each must assume its particular responsibilities. 

The role of government is defined as that of preserving and strengthening the 
institutions of competitive enterprise. This can be accomplished (1) by pur- 
suing policies which encourage the spirit of enterprise and promote incentives 
to work, to save, and to invest, (2) by exercising a strict discipline over expendi- 
tures, (3) by strengthening competition, (4) by pursuing policies which facilitate 
adjustments to advancing technology, and (5) by pursuing policies that will 
help maintain high levels of production and employment and income, earned 
in dollars of stable purchasing power. In a free economy the role of government 
should be limited. Business and labor must therefore assume important eco- 
nomic responsibilities. 

Business management has the responsibility to help avoid economic im- 
balance and dislocation and much progress has been made in this area—to 
wit, the trend toward long range capital expansion programs and organizing 
operations with a view to greater stability of employment. Management and 
labor have the responsibility to improve competition and to promote new tech- 
nology. Management and labor also “have the responsibility to reach agree- 
ments on wages and other labor benefits that are fair to the rest of the com- 
munity as well as to those persons immediately involved.” 

Characteristically, the Report’s policy recommendations imply cautious 
moves rather than bold changes in policy. In some cases more vigorous action 
should be taken. It is appropriate to survey some major recommendations 
through which the Administration would implement its basic economic philoso- 
phy. The Administration’s fiscal policy is considered adequate even though in- 
flation has gained the ascendancy. The high level of government expenditures 
is defended on the grounds that they provide essential services. The program 
for strengthening competition is lauded; financial, administrative, and tax aids 
to small business must be continued and improved; tariff reductions should be 
gradual and the U. 8S. must continue to cooperate with international trade 
organizations. The Report moves courageously in the matter of Federal aid for 
education, motivated by the shortage of educational facilities and trained man- 
power. In agriculture, efforts to solve the perennial problem of surpluses con- 
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tinue, with chief reliance now being placed on the Soil Bank Program, appar- 
ently as a transitional measure. Plans are advanced for improving health 
through education and research, and for strengthening personal security 
through more adequate unemployment compensation and elimination of wage 
discrimination on account of sex. It is surprising to find in the Report no full 
scale analysis of the important consumption-s3 ving-investment relationship, a 
subject which in a time of inflationary pressures should be deserving of careful 
study. 

In a paper of this scope it is of course impossible to discuss the economic im- 
plications of all of the many policy recommendations contained in the Report. 
The policies advocated typically would promote, although in some cases only 
slowly, a freer and more competitive economy. With notably few exceptions, 
adoption of the policies recommended would help the economy move closer to 
the ideal of maximum employment and growth with stability in the purchasing 
power of money. There are, however, certain policy recommendations which, 
though not in direct coaflict with the Report’s philosophy, deserve greater 
emphasis, clarification, or discussion. There are three areas to which I should 
like to devote particular attention—fiscal policy, voluntary restraints by lead- 
ers of labor and management, and agricultural policy. 


FISCAL POLICY 


As the year 1956 closed, inflation was the nation’s number one economic 
problem. During 1956, price rises were sharp and widespread. Despite cour- 


ageous efforts of the monetary authorities in preventing an undue expansion of 
credit, inflationary forces gained the ascendancy. The Report correctly applauds 
the policy of general monetary restraint, which was strengthened as the boom 
progressed, and cautions of the dangers of inflation. In view of the Report’s 
justified concern for stability in the purchasing power of money, it is indeed 
surprising to find it content with a fiscal policy which is less than adequate to 
deal effectively with the problem of inflation. In a period of price inflation, 
monetary and fiscal policy must be employed to combat the forces making for 
rapidly rising prices. Fiscal policy should be used with the same vigorous de- 
termination as monetary policy. 

Against the background of an economy operating at near-capacity levels and 
with demand exceeding supply in many areas the President proposed a federal 
budget for fiscal 1958 of $71.8 billions, the largest for peace time in the history 
of the nation. Certainly the presentation of this budget against a background 
of mounting inflation has given rise to fears of a further erosion of the dollar. 
The economic incongruity of the record budget is emphasized by the Report’s 
exhortation to business and labor leaders to show restraint in matters of wage 
demands and price increases, by its policy declaration to restrain the forces of 
inflation by maintaining sound Government finances, and by its further declara- 
tion to promote economic growth while maintaining the purchasing power of the 
dollar. 

The Report states that: 


the financial affairs of Government should be so administered as to help stabilize 
the economy and to encourage sound growth. The principle of flexibility in fiscal 
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policy calls for relating the budget as far as feasible to economic conditions, helping 
to counteract inflationary or deflationary tendencies as the situation requires. 


Despite the foregoing policy declaration, the budget for the fiscal year starting 
July 1 apparently reveals a significant change in basic Administration policy. 
Four years ago the Administration’s avowed aim was to reduce Federal spend- 
ing and within two years it had effected a substantial reduction in the Federal 
budget. In contrast, expenditures in the proposed conventional budget are 
$7 billion greater than those of fiscal 1955. 

The Report takes satisfaction in a balanced cash budget and a moderate 
surplus anticipated for the 1958 fiscal year, even though slightly larger cash 
budgetary surpluses were ineffective in halting inflation during fiscal 1956 and 
1957. It is necessary at this point to consider the cash budget because it is the 
one which exerts the most influence on the level of business activity. Only the 
cash budget reveals some of the most dynamic factors in over-all Federal 
expenditures—such as outlays for the secondary mortgage operations of the 
Federal National Mortgage Association, the highway program, and the in- 
crease in social security trust fund benefits. 

The pressure on prices makes it abundantly clear that without a large surplus 
there is risk of further inflation. Compensatory fiscal policy requires that when 
the economy is operating at near-capacity levels, government expenditures 
should be reduced in order to help ease the pressure upon existing resources 
and therefore upon prices. Flexible fiscal policy requires that the surplus be sub- 
stantially larger than that anticipated in the President’s budget proposal. Fiscal 
theorists recognize that even a balanced budget can have inflationary effects. 
When, for example, government spending increases, and taxes are even raised, 
consumers do not necessarily cut their purchases by the same amount as their 
tax increase. Rising wages may more than offset the tax increase or consumers 
may cut savings in order to maintain their level of living. The result is that an 
increase in Federal spending, even though the budget is balanced, may lead to 
a net addition to total demand for goods and services and therefore be infla- 


tionary. 
VOLUNTARY RESTRAINTS BY LEADERS OF LABOR AND MANAGEMENT 


The President’s Report recommends that in this time of inflationary pressure, 
business and labor leaders should both exercise restraint in their price policies 
and wage agreements in the interests of preserving the purchasing power of 
money. This policy recommendation is in keeping with the objectives of the 
Employment Act. Although such a program is highly desirable from the stand- 
point of society, more concrete proposals such as an appropriate fiscal policy 
are also needed to deal effectively with the problem of inflation. To be sure, 
statesmanship on the part of business and labor leaders can occupy an im- 
portant place in a free market economy but it would be unwise to overestimate 
the effectiveness of voluntary restraints in a time of strong inflationary pres- 
sures. 

In reality, the problem of voluntary restraints becomes one of determining 
what wage-price policies would be consistent with the maintenance of a stable 
dollar. A guiding principle of course is that wage increases should not exceed 
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gains in productivity. This oversimplifies the problem, however, because in 
reality there is great variation in wage increases and productivity gains from 
industry to industry and within a single company over time. Furthermore, im- 
mediate application of the policy, assuming that it could be generally agreed 
upon, would call for widespread revision of existing wage contracts and pricing 
policies. 

Even though the proposal is highly desirable, widespread and equitable im- 
plementation of the polivy at the present time is hampered by the absence of 
workable machinery or standards which leaders of business and labor could use 
effectively in assuming responsibility for a stable price level. A prior condition 
for the effective implementation of voluntary restraint is availability of up-to- 
date statistics which will show the relationships between wages, productivity, 
prices, profits, and other kinds of income. 

Until the requisite data are available and agreement is obtained from labor 
and management on implementation of the program, public policy should seek 
to make the market mechanism work more efficiently. Competition should be 
vigorously maintained and strengthened in all markets and the government 
should strengthen its anti-inflationary fiscal policy. Economic history shows 
that if inflation is to be effectively dealt with, it must be fought on all fronts. 


AGRICULTURAL POLICY 


Despite continuous efforts to solve the problem of excessive agricultural sur- 
pluses, adjustment of farm production to domestic and foreign demand has 


not yet been achieved. The disposal overseas of surplus supplies at cut-rate 
prices has been helpful but has not solved the basic economic problem, nor 
could it be expected to do so. Severe acreage allotments have been only par- 
tially successful because of increases in yield per acre. Furthermore, the diver- 
sion of acreage to other crops has caused surplus conditions to spread. The Soil 
Bank Program, which pays farmers for not using the soil for crops, failed to 
reduce output importantly in 1956. The Report admits that if the Soil Bank 
Program is to be effective it will be in the year 1957. It is undoubtedly true that 
the effect of the program will diminish in 1958 and 1959 because of rising yields 
per acre on the land remaining in cultivation. 

The goal of the Administration has been to move away from high rigid price 
supports and toward a flexible price support program, with a view to bringing 
production more nearly into line with consumption. In addition to flexible price 
supports, the parity formula and its application is being examined by the 
Secretary of Agriculture with a view to its making a contribution to “a prosper- 
ous, balanced, and free agriculture.” 

It is undoubtedly too early to evaluate the effectiveness of the Soil Bank 
Program in reducing surplus production but to date it has had only mild 
success. Its effectiveness this year and next, furthermore, will be offset by the 
increase in yield per acre. It is well-known that when acreage restrictions are 
imposed, farmers tend to increase their output per acre through more intensive 
cultivation of the remaining acres. 

Despite years of effort and the expenditure of billions of public funds, the 
inescapable conclusion is that the economy is still faced with the major problem 
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of excessively high production in agriculture. What is more serious is that the 
Report apparently envisages no near-term solution to the problem. 

When prices are maintained at artificial levels, when acreage allotments are 
employed, and when large payments of public funds are made for keeping land 
idle, it is difficult to see how the farm problem can be resolved in terms of a 
“balanced, and free agriculture.” Rather, resources will continue to be em- 
ployed to produce food products in quantities in excess of that demanded in 
the market place. The economy will be denied the production which would flow 
from a free market allocation of resources, and consumers will continue to pay 
artificially high prices for those products affected by the agricultural price 
support program. 

The free market’s answer to the problem of agricultural surpluses is that 
excessive production would bring lewer prices and the lower farm incomes 
would in themselves provide for the necessary adjustments. Those farmers 
receiving low incomes relative to other occupations would be induced to move 
out of farming and into better paying occupations. The income of farmers 
remaining on the soil would rise until there was no incentive to move into 
another occupation. The resources employed in agriculture would thus be 
reallocated with an increase in the total output of the nation conforming in 
greater measure to the desires of consumers. 

In summary, one can state that despite the general cautiousness of the 
Report's policy recommendations and the apparent weakness of a few proposals, 
enactment of its major policy recommendations would undoubtedly result in 


moving the economy, although slowly in some cases, closer to the basic Ad- 
ministration objective of competitive resource allocation. 
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commodity groups is defined in accordance with 
the method of Neyman. Ivan M. Lez, University 
of California, Berkeley. 


Bartholomew, D. J., “Tests for randomness in a 
series of events when the alternative is a trend,” 
Journal of the Royal Statistical Society (B), 18 
(1956), 234-9. 


Let the total time of observatioa be unity and 
the events occur at times T;, i= 1, 2,- + n. The 
properties of Cox's test for randomness against a 
trend based on m= 2%T;/n are considered. If 
8,(T) =k/n for Tr<T< Tea, is the sample dis- 
tribution function then it is shown that 4—m 
= fj {S,(T)—T}dT, and the relation displayed 
between this test and those of Kolmogoroff and 
von Mises. For n=5 it is shown that Cox’s test 
is more powerful than the Kolmogoroff tests 
against an exponential alternative. E. 8. Pacs, 
University of Durham, England. 


Downton, F., “On limiting distributions in bulk 
service queues,” Journal of the Royal Statistical 
Society (B), 18 (1956), 265-74. 


A random arrival queue has a batch service of 
up to s people at each service epoch; the distri- 
bution of the service time is x? with even degrees 
of freedom. For the queue in equilibrium the lim- 
iting distribution of the ratio of waiting time to 
batch size as 8 (i.e. the batch size) tends to in- 
finity is derived; the moments are given to- 
gether with those of the queue size. The simi- 
larities between the limiting behavior of the 
batch service queue and of a single server queue 
are discussed. E. 8. Paar, University of Durham, 
England. 


Finney, D. J., “Response curves and the 
of experiments,” The Indian Journal of Agri- 
cultural Science, 23 (1953), 167-86. 


What levels of treatment should be chosen 
when an experiment is to deal with a few levels of 
one or more quantitative treatments, and when 
sufficient rough pricr information about the re- 
sponse curve or surface is available to permit 
fairly explicit consideration of economic factors? 
This question is discussed in connection with 
fertilizer treatments as they effect crop yields. 
Some major points are: (1) the economically 
optimum level of a fertilizer treatment is usually 


far below the level giving maximum yield, be- 
cause of fertilizer cost and generally decreasing 
marginal yield as fertilizer level increases; (2) 
the levels used in an experiment, if any rough 
knowledge of the response curve is available, 
should be approximately centered on the roughly 
guessed optimal economic level, and should usu- 
ally be three or four in number; (3) the use of 
other than actual yield as the response variable, 
e.g. yield divided by yield at zero level or yield 
per pound fertilizer per acre, is sharply criticized; 
(4) it is usually important to include the zero 
level treatment. A particularly interesting sec- 
tion deals with levels for two fertilizers, simul- 
taneously applied. In addition to fertilizer levels, 
discussion of plant spacing levels (for which there 
is no absolute zero) is incl . Wiituiam Krvs- 
KAL, University of Chicago. 


Hirsch, Werner Z., “Firm progress ratios,” Econ- 
ometrica, 24 (1956), 136-43. 


Empirical progress ratios are presented based 
on linear logarithmic progress functions fitted to 
labor input and output data of a single large man- 
ufacturer for eight different products in the pe- 
riod 1946-1951. In addition to over-all progress 
ratios, separate ratios for assembly work and 
machining work for seven of the products are 
presented and the implications of their compara- 
tive magnitudes briefly discussed. Finally, ratios 
for two related products are presented and re- 
garded as providing suggestive evidence on the 
effect of compounded experience on the 
eters of the progress function. Ivan M. Leg, Uni- 
versity of California, Berkeley. 


Miller, Irwin, and Freund, John E., “Expected 
arc length of a Gaussian process on a finite inter- 
val,” Journal of the Royal Statistical Society (B), 
18 (1956), 257-8. 


The expected arc length of a stationary Gaus- 
sian process with autocorrelation p(r) on the in- 
terval (0, T) is found as a function of ae: 
E. 8S. Paan, University of Durham, E. 


Teicher, Henry, “Identification of a certain 
structure,” Econometrica, 24 (1956), 
172-7. 


Conditions are derived for identifiability of the 
stochastic difference structure X;= U;+aUj;_., 
U; independently and identically distributed. Al- 
though the U; are independent, successive pairs 
of observables X; are not independent. It is 
shown that a sufficient condition for the identifi- 
ability of a is that the distribution of U (or X) be 
nonnormal. Frvther, for both normal and non- 
normal X, conditions on @ are derived for the 
identifiability of both a and the distribution of U. 
Finally, with reference to the linear structure 
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connecting observable random variables X to 
the nonobservable random variable Y¥; in Xiu 
=¥,+ Vix, Xe=at+8¥it+ Va; in which random 
“errors” V; and Vz: may be dependent but not 
necessarily normal and EV,= EV2=0 or median 
Vi=median V:=0, it is shown that a sufficient 
condition for the identifiability of 8 is that the 
distribution of Y; not be a factor of any linear 
combination of V; and V2. Ivan M. Lzz, Uni- 
versity of California, Berkeley. 


Vajani, Luigi, “La distribuzione di x? ed una sua 
verifica sperimentale,” Rivista Italiana di Econ- 
omia Demografia e Statistica, 7(2-3-4) (1953), 
127-58. 

This article contains a detailed and elementary 
exposition of properties of the chi-square and in- 
complete Gamma distributions. The standard 
tests and confidence interval methods on popu- 
lation variance, when sampling from a normal 
distribution, are described. The most interesting 
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feature of the paper is its description of an ex- 
periment to examine the goodness of the chi- 
square approximation to the distribution of the 
chi-square statistic, in examining goodness of fit 
to a given multinomial. The given multinomial 
has ten cells with probabilities .05 (four cells), 
-10 (two cells), and .15 (four cells); sample size, 
that is multinomial n, is 100; and 100 such sam- 
ples are drawn. For each sample the traditional 
x? statistic is computed, and the resulting 100 
point empirical distribution is compared with the 
x? (9 d.f.) distribution. The comparisons are (a) 
via sample mean and variance (9.46 and 17.99 
respectively, against 9 and 18 for x? (9 d-f.)), 
(b) graphical, and (c) via carrying out a x? test 
on the observed values (P-value 0.877). This chi- 
square approximation appears generally reason- 
able, on this evidence, but sample size is too small 
to permit firm conclusions on the adequacy of the 
approximation in the tails. KrusKat, 
University of Chicago. 


BOOK REVIEWS 


Fundamental Statistics for Business and Economics. John Neter and William Wasserman. 
New York: Allyn and Bacon, Inc., 1956. Pp. xiv, 638. $6.50. 


Francis McIntyre, California Texas Oil Company, Lid. 


0 authors of this elementary text state that it is designed to provide an under- 
standing of the uses of statistical methods as “tools of decision making.” This 
phrase probably reflects the current popularity of operations research. Since the ap- 
proach of the text is described as “nonmathematical,” it may be questioned whether 
enough can be said about operations research at this level to justify a treatise. 

If, however, the text is regarded not as a work on decision making but simply as a 
straightforward elementary statistics book for business and economics students, this 
reviewer concludes the authors have done a good job. The book has been designed 
for use in a one-quarter or one-semester statistics course and does not presuppose any 
competence in the use of symbols beyond the high school freshman level. To shorten 
or simplify the treatment, portions of the text have been designated for possible omis- 
sion. Such “technical » tions” include those dealing with the extraction of square 
roots. In view of the number of high school students who manage these days to com- 
plete a reputable course in calculus before entering the university, it seems a pity that 
such an approach is necessary, but the author’s prefatory acknowledgment of in- 
debtedness to seven university professors and a private “business research manager” 
satisfy this reviewer that the authors have taken the steps necessary to assure that 
the level of the text is appropriate to the intended audience. 

The newness of forecasting as a science is rather drastically overstated: “Until 
quite recently, business and economic statistics were collected primarily for purposes 
of historical analysis” (p. 3). 

At the risk of revealing the reviewer's bias he must comment on what seems to him 
the imbalance, in a discussion of data sources, of devoting six pages to U. S. sources 
and only one-quarter page to the rest of the world. There seems to be no mention 
whatever of sources of foreign data other than the international organizations. How 
about such sources as The Economist, The Financial Times, the quarterly publications 
of central and private banks abroad, etc., etc.? 

The text contains a good, brief discussion of electronic data processing but scant 
acknowledgment of the existence of punched card firms other than IBM. The dis- 
cussion of errors, and of rounding and significant digitr seems first-rate. (True, the 
inference is given that the quotient of two numbers, «ach of n significant digits, will 
itself contain n significant digits, but these rules are described as “approximate.”) 

Chapter V (Graphics) is particularly good. It is accordingly unfortunate that Figure 
5.33 (a), which is presented as “simple and effective,” is captioned “Total Ship- 
ments,” while the X axis is identified as “Actual Sales.” The chart is in two segments 
(evidently a quarterly and a monthly presentation, although this fact is not particu- 
larly well-marked in the labeling) using a common vertical scale. Hence, quarterly 
sales for 1949 are grouped alongside monthly sales for 1950-51 without comment 
or explanation. Since the authors clearly demonstrate a real appreciation of charting 
problems, it is unfortunate that they have chosen from a commercial source a chart 
which is not up to their own standards. 

In the treatment of elementary concepts such as standard deviation, formulas are 
shown and applied with the terms and symbols of the formulas being defined on a 
following page. This may be a conscious effort to ease the pain for the so-called non- 
mathematical student, but the reviewer questions whether it is good for him. 
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The chapter ends are replete with excellent problems for use in class and homework 
assignments. 

The text gives a delightful description of taste tests in blended whiskey as an 
example of the binomial probability distribution (p. 249-50). 

There is an interesting discussion of the Poisson distribution—all in terms of appli- 
cations when it may be assumed that the Poisson distribution is applicable! Very little 
is said as to testing this assumption (pp. 252 ff.). 

While the sampling chapter recognizes by frequent comment the existence of non- 
sampling errors, the tone of the chapter suggests that since these do not lend them- 
selves to precise calculations as to the standard errors, or means, we should not worry 
too much about them. 

The explanation of the confidence interval concept (p. 316) is really quite satis- 
factory. 

Chapter IX contains some interesting case studies in sampling. The authors find it 
necessary, however, to leave a great deal unexplained. For example, “Relevant statis- 
tical theory was then employed to decide how large the sample from each stratum 
must be so that the amount of revenue due the Chesapeake and Ohio could be esti- 
mated with the required precision from as small a sample as possible” (p. 332). And 
again, “By appropriate statistical procedures, the results for each stratum were com- 
bined to form an estimate of the total amount of freight revenue due the Chesapeake 
and Ohio” (p. 333). Could not at least a footnote have indicated the relevant statisti- 
cal theory and the appropriate statistical procedures? 

The definition of “degrees of freedom” as being simply the denominator in the 
formula for the sample standard deviation seems a bit limited (p. 344). One is re- 
minded of the definition of a newspaper as “The thing in which the garbage is 
wrapped.” Similarly, the explanation of degrees of freedom within the discussion of 
linear regression and correlation leaves much to be desired (p. 435). 

The discussion of index numbers (Chapter XII) explains the weighting concept in 
great detail but does not in its initial presentation of computation of index numbers 
give the student any warning as to the possible inappropriateness of appl:ing a single 
set of weights over a period of years. It is true that the notion that weights selected 
as typical of one set of conditions may be atypical in another is eventually developed, 
but thirty pages after the chapter begins. The question is whether this warning comes 
early enough to be truly effective. 

The authors’ discussion of Time Series: Long Term Analysis (Chapter XIII) 
shows a realistic appreciation of the problems of industrial forecasting. 

The discussion of short-term forecasting, in Chapter XIV (Time Series: Short- 
Term Analysis), suggests that the technique of forecasting company sales by first fore- 
casting industry sales and applying the company’s expected share of market is a com- 
mon procedure. This is an interesting comment and the reviewer cannot help wonder- 
ing how stable the variable “share of market” remains for individual forecasters using 
this approach. 

The discussion of seasonal variation (Chapter XIV) seems only belatedly to dis- 
tinguish trend elements from seasonal elements. Thus, when an index of seasonal 
variation in revenue passenger miles flown by an airline is shown as 79.7 for January 
and 91.0 for December, there seems to be no consideration of the possibility that the 
December fignre is higher than January in part because of the pronounced upward 
secular trend in the whole time series. 

It is true that later on the authors talk about the removal of trend before calcula- 
tion of seasonal indexes, but—and this is a general criticism of the text—the pointing 
out of this concept is so far removed from the basic discussion of the problem that 
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the reader is apt not to appreciate the essentiality of the trend removal technique. 

Sometimes the choice of language in describing the seasonal concepts is unfor- 
tunate. For example (pp. 575-6), a study of department store sales shows that 
1,426,000 transactions occurred in the month of November, for which month the 
store had calculated a seasonal index of 178. The deflated (or, as the authors put it, 
“aeseasonalized”) number of transactions is, accordingly, 801,000. The authors then 
say. “If November business activity in the store continued for an entire year, the 
average monthly number of transactions during the complete twelve-month period 
would be 801,000.” What this must mean is certainly not that November business 
activity is assumed to continue for an entire year, but rather that activity in the rest 
of the year is assumed to be such as to make 1,426,000 (the observed November 
figure) exactly 178% of the monthly average. 

There is no question that the authors have developed a useful book. The preceding 
comments, undoubtedly excessively detailed, have been critical of a relatively small 
portion of the 652 pages of preface and text. What does seem to the reviewer insoluble 
is the problem of presenting essentially mathematical material in a “nonmathematical” 
way. 


Statistics for Management. B. J. Mandel. Baltimore, Maryland: Dangary Publishing Co., 
1956. Pp. xii, 408. $6.00. 


Jacos Mincer, City College of New York 


7 sense in which this new text constitutes an attempt to differentiate the product 
is better indicated by the subtitle “A Simplified Introduction to Statistics” than 
by the title “Statistics for Management.” Aside from its purely semantic accent on 
“management,” the book is a conventional text for an introductory one-semester 
course for business students. Its contents include: collection and presentation of data 
(Chapters 2, 3, 13), frequency distributions, averages, and dispersion (Chapters 4, 5, 
6), sampling and estimation (Chapters 7, 8, 9), correlation (Chapter 10), time series 
(Chapter 11), and index numbers (Chapter 12). 

According to the author, the text owes its existence to students’ “expressions of 
frustration in attempting home study in statistics with textbooks that took too much 
for granted and gave more subject matter than could be absorbed in an introductory 
study” (p. vii). The effort to simplify is commendable if the reduction of coverage 
and ease of exposition serve to focus on a few basic statistical ideas in an interesting, 
thought-provoking fashion. This objective, if it was sought at all, is not achieved in 
the present book. 

The restriction of subject matter to the topics, concepts, and measures which are 
presented is for the most part defensible. Certainly, little is lost by omitting discus- 
sions of an assortment of measures of kurtosis, skewness, or even dispersion, and of a 
variety of computational short-cuts, cook-book style. However, leaving out even the 
simplest notions about tests of hypotheses, two types of error, and sampling aspects 
of correlation reduces statistics to a purely descriptive technique. The beginning stu- 
dent cught to be aware that statistics is a much richer subject. 

The nemesis of the simplifier is that his efforts may easily produce obscurity or 
oversimplification. As a result one finds a number of statements which are likely to 
mislead. Thus the normal distribution is said to arise when “a large number of fac- 
tors, none of which are outstanding in their influence on the final values of the variable, 
combine to cause the values of the variable.” This situation obtains when “chance or 
nature’s forces cause the variations” (p. 82). Such expressions may be harmless, but 
the student is told that the above definition actually serves to predict the occurrence 
of normal distributions in a variety of phenomena. In particular, this predictability is 
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illustrated in the chapter on sampling by the normality of the sampling distribution 
of means. Presumably, this normality is to be expected since sample means are ob- 
tained by a chance selection process (p. 193). The student could not be blamed if he 
concluded that all sampling processes generate normal distributions. 

Incidentally, the numerical example given in connection with the normal sampling 
distribution of means involves samples of size 10. Fortunately, though this is not 
stated explicitly, the parent universe is nearly normal. But a few pages later the ex- 
plicit statement is made that “the distribution of sample means tends to form a nor- 
mal distribution regardless of the pattern of the universe” (p. 203). At the same time 
the sample standard deviation is computed by the formula >. d?/n (p. 214) though 
V>X#/(n—1) is preferable for so small a sample. 

In the discussion of sampling the impression is left that sampling procedures are de- 
signed to obtain a miniature replica of the population (see in particular pp. 151, 
169). 

Elsewhere, it is stated that standard deviations or coefficients of variation can be 
used to test the homogeneity of a population. Presumably a large dispersion indicates 
heterogeneity. The vagueness in such statements as to what could be, must be, and 

_ what constitutes a test is highly misleading. 

For a text slanted toward business applications the treatment of time series 1s 
weak. Emphasis on least-squares straight-line trends and on a method of seasonal 
adjustment which is based on the assumption that pluses and minuses of the cycle 
around the trend tend to balance out, is of doubtful value. Similarly, an index com- 
puted by simple averaging of original price quotations is better not mentioned. 

The use of taxonomy as an expository device is sometimes peculiar. Thus, “there 
are three methods of correlation analysis—logical, graphic, and statistical” (p. 269). 
Logical analysis, it turns out, is one based on “feel” or judgment. The same disturbing 
classification and terminology are used in the discussion of time series. 


Statistical Sampling for Auditors and Accountants. Lawrence L. Vance and John Neter. 
New York: John Wiley and Sons, Ine., 1956. Pp. x, 310. $9.00. 


Howarp L. Jones, Iilinois Bell Telephone Company 


I. making an audit of a firm or company of any size, public accountants do not make 
a detailed examination of every recorded transaction. They feel that making only 
a partial inspection of the records provides sufficient information to enable them to 
judge whether or not the financial statements based on those records are reasonably 
accurate. The actual procedure consists in “making spot checks,” which is equivalent 
to what a statistician would call “inspecting a judgment sample.” A few pioneers 
have experimented with scientific sampling to some extent. But only a beginning 
has been made in this direction. 

The purpose of Vance and Neter in writing their book is “to provide auditors and 
accountants, and students of those fields, with an introduction to some of the statitical 
sampling techniques that should be useful in auditing and accounting.” They hope to 
“encourage auditors and accountants to experiment with the application of statistical 
sampling techniques so that a broad body of practical experience in this field will 
be developed.” At the same time, they are cautious about “telling the reader ready- 
made ways in which statistical sampling techniques can be applied to any specific 
auditing or accounting problem.” They feel that “extensive experience in the applica- 
tion of statistical sampling techniques to auditing and accounting problems will be 
necessary before definitive solutions can be given.” 

The book is divided into four parts. Part I consists of a brief introduction, with 
chapters on the “Need for Statistical Sampling Techniques in Auditing” and “Some 
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Basic Sampling Concepts.” Part II, on “Statistical Decision Making,” is largely de- 
voted to the various types of acceptance plans commonly used ix statistical quality 
control. The underlying theory is discussed, and instructions are given for using the 
Department of Defense tables (Military Standard 105A), the Dodge-Romig tables 
for single and double sampling, and Cameron’s table for constructing single sampling 
plans. A brief chapter on control charts is included. Part III, on “Statistical Estima- 
tion,” covers the calculation of sample means, proportions, and ratio-estimates for 
simple random sampling and the measurement of their precision, the calculation of the 
sample mean and the precision for stratified sampling, and the formulas for optimum 
allocation. The last chapter in this part of the book is on case illustrations of sampling 
applications by statisticians in private industry. Part IV consists of a chapter setting 
forth ideas, based on the authors’ experience, on “Organization in the Auditor’s Office 
for Utilization of Statistical Sampling Techniques.” Each chapter is followed hy ex- 
tensive references, including some that are too mathematical for the average reader. 

The treatment of the subject matter is largely from the point of view of the external 
auditor. From this viewpoint, the choice of topics for presentation, and the relative 
amount of space devoted to each topic, seems very good on the whole. The use of 
illustrations from the auditing field will doubtless make the discussion more intelligi- 
ble to accountants than the usual presentation in statistical literature. 

The inclusion of both an introduction and a summary to some of the chapters seems 
rather superfluous. A rearrangement of the order of presenting the topics, with esti- 
mation discussed before acceptance sampling, might also have effected some improve- 
ment. It was noted, for example, that forward references to Chapter 7 appear on 
pages 12, 17, 37, 38, 40, 43, 54, 88, and 91. Rather than keeping the reader in sus- 
pense, the authors might have presented the material in this chapter, particularly 
the material on how to select a random sample, at an earlier place in the book. It 
seems altogether possible that an accountant might read the first six chapters without 
realizing that the procedures he is currently using are not the equivalent of random 
sampling. 

Recent applications of sampling to accounting problems in the Bell System and 
elsewhere suggest that estimation techniques may assume a more important role rela- 
tive to acceptance sampling than the one now assigned to it by the authors and other 
accountants. These techniques may become more valuable to auditors when they are 
better understood. It seems likely that most auditors are primarily interested in the 
question of whether the figures appearing on financial statements are reasonably ac- 
curate or not. They are also interested in the question of whether or not the bookkeep- 
ing operations are of acceptable quality; but this latter question is of interest to them 
chiefly for the bearing it may have on the answer to the first question. While accept- 
ance sampling is useful in finding the answer to the second question, estimation pro- 
cedures can help in making a direct attack on the question that is probably of primary 
interest. Theoretically, these techniques could even be applied to computing the sam- 
pling precision of a corrected figure as to the net worth (the difference between the sum 
of the assets and the sum of the liabilities), provided a corrected balance and its sam- 
pling precision have been computed for each asset and liability. 

The book appears to have few, if any, errors from the standpoint of statistical 
theory. One or two minor points may be worth mentioning, however. On pages 91 and 
92, the reader is cautioned that “the methods to be described are applicable if the 
correctness of any item is independent of the correctness of the other items, and if the 
basic conditions affecting the correctness of the items have remained the same over 
the period of time from which the sample is selected.” These qualifications are un- 
necessary with respect to the single and double sampling plans subsequently de- 
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scribed, provided the selection of the sample items is random. They are relevant, how- 
ever, if cluster sampling or systematic sampling is employed, but the sample data are 
analyzed as if they resulted from simple random sampling. The precaution on page 92 
that “if the auditor is sampling a finite population . . . the methods to be described are 
applicable only if the saraple size is relatively small . . . ” seems too sweeping. As is 
implied on page 69, the practical effect of a large sampling fraction is to make the risks 
smaller than would otherwise be the case, and this effect need not be disturbing to 
the auditor. On page 153, on the other hand, it would have been well to caution the 
reader that revised control limits are to be regarded as only tentative unless the work 
operation was actually in control during the period covered by the computations, as 
evidenced by the fact that all or substantially all the plotted points for that period 
were within the first trial control limits. Once control limits for a clerical operation 
have been determined for a period when the operation is in control, however, it seems 
best for psychological reasons to leave them unchanged until there has been a major 
change in the routines or conditions affecting the operation. 

The principal defect in the book is the poor English composition in many places. 

On pages 126 and 127, for example, we find the following: “In appraising a set of 
accounts, the auditor may find errors ranging from the insignificant to those requiring 
refusal of an over-all opinion. . . . It is imperative that a clear definition of error be 
arrived at in advance of the auditing if statistical acceptance sampling by attributes 
is to be successfully employed. In using the ‘per cent incorrect items’ as 4 basis for de- 
cision, certain fundamental choices confront the auditor, as follows: 

“(1) Errors involving minor amounts may be included with those involving serious 
amounts. 

“(2) Procedural errors may be included with errors in amounts (substantive 
errors). 

“(3) Any or all three of the error types may be treated separately. . . . 

“The foregoing list, it should be remembered, applies only if accounting transac- 
tions . . . are to be classified as correct or incorrect.” 

The passage just quoted gives rise to questions like these: What is an over-all 
opinion? What is an error requiring refusal of an over-all opinion? What is a funda- 
mental choice? What are the alternatives among which a choice is to be made? What 
is a serious amount? After a little reflection, an accountant can doubtless form a fairly 
good idea as to what the authors are trying to say. But many accountants are sticklers 
for good and lucid English, and they are likely to be irritated by reading poorly con- 
structed sentences. They may not be too pleased with pedantic expressions like “it 
is imperative” and “it should be remembered.” A little more care in phrasing the 
authors’ thoughts would have created a better impression of the worth of their 
efforts. 

Most accountants who read this book from cover to cover will learn a good deal 
about sampling theory and its application to accounting problems. There now ap- 
pears to be no other single volume where they can find nearly all the essentials on 
how to apply this theory in their own accounting work. 


A Primer of Social Statistics. Sanford M. Dornbusch and Calvin F. Schmid. New York: Mc- 
Graw-Hill Book Company, Inc., 1955. Pp. vii, 251. $4.75. 


R. R. Banapur, Indian Statistical Institute, Caleutta 


HIS book is a short introduction to the elements of modern statistics. It draws upon 
the social science field, particularly sociology, for its illustrative material and 
examples, but it is essentially an exposition of principles and methods. The book is 

addressed especially to readers with little knowledge of algebra. 
A little more than half the book is devoted to descriptive statistics, and the rest 
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to sampling and inference. Tables of squares, Fisher’s z transformation, the Normal 
distribution, and percentage points of the null r, ¢, and chi-square distributions are 
given in an appendix. 

The authors have made considerable efforts to orient the beginner and to sustain 
his interest. This reviewer thinks that the resulting treatment of descriptive topics is 
very successful and should prove stimulating and useful to a wide variety of readers. 
Topics treated wholly or in part in a descriptive manner include tabular and graphic 
presentation; measures of location and scale; ratios, proportions and rates; correlation 
and linear regression; and contingency tables. 

It is unfortunate that the careful selection of topics, and the generally careful and 
well organized writing, of the descriptive parts of the book are not maintained in the 
sections concerning sampling and inference. In fact, much of the material on sampling 
and inference is inadequate, or misleading, or erroneous. 

In the section on random sampling (p. 121), random samples are defined as “sam- 
ples in which every case in the population has an equal chance of inclusion in the 
sample.” In conformity with this incomplete definition, there is no description of how 
a random sample may be drawn, and no mention of the distinction between sampling 
with or without replacement. Incidentally, there is an apparent reference later on 
(p. 156) to the effect of this last distinction on the variances of sample proportions, 
but without explanation, in the following terms: “Statistical inference is concerned 
only with the size of the sample, not with the size of the population, as long as the 
population is more than five times the size of the sample.” 

The sections on sampling and sampling distributions in general (pp. 116-23), are 
followed by a chapter on statistical inference in general (pp. 124-31) that is full of 
statements likely to be unhelpful or confusing to beginners: e.g., “Statistical inference 
is concerned with making decisions” ; “In most social research we are willing to make a 
type I error 5 per cent of the time”; “If the probability of the null hypothesis being 
correct is less than .05 the null hypothesis is rejected.” The chapter concludes with 
certain relatively sophisticated and even controversial remarks on small samples that 
deserve more extended treatment but might well be omitted from a first course. 

The reviewer takes this opportunity to say also that the authors’ precept and prac- 
tice (cf., e.g., p. 126, p. 149 and p. 214) of always testing hypotheses at the 5 per cent 
level seem untenable, and contrary to the intelligent and critical use of statistics. 

Other flaws and errors that seem worth mentioning here are the foliowing: 

(a) Quite often a phrase such as “the probability of the statistic taking the value v” 
is used instead of the correct “the probability of the statistic taking a value as extreme 
as v or more extreme” (cf. pp. 119-20, p. 127 (middle), p. 129 (bottom), p. 152). 

(b) The probability of a standardized Normal variate falling beyond the limits 
+1.96 is sometimes stated to be less than .05 and at other times to be equal to .05. 
(p. 144). 

(c) The formula on p. 150 for the standard deviation of a difference between in- 
dependent proportions from large samples is in fact valid for small samples also. 

(d) The use of the phrase “95 per cent confidence” in the illustration on p. 155 
seems unnecessary and possibly misleading. 

(e) The calulation of the two sample ¢ test on pp. 165-166 is valid only in large 
samples. 

(f) The mean of Fisher’s z transformation of r is incorrectly stated to be zero (p. 
200 and p. 203). 

In conclusion, the reviewer emphasizes that the preceding criticism refers to only 
part of an otherwise excellent book, and also that most of the flaws noted by the 
reviewer (arising as they do from faulty writing or editing rather than from a faulty 
design) can be corrected easily by teachers using the book. 
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Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probabil- 
ity. Jerzy Neyman, Editor. Vol. I: Theory of Statistics; Vol. Il, Probability Theory. 
Berkeley and Los Angeles: University of California Press, 1956. Pp. viii, 208; viii, 246; 
$6.00; $6.50. 

isTING of the contents of Volumes I and II follows. For the contents of the other 
pe as see the Journal for December 1956, pp. 659-60. 

Volume I: Joseph Berkson, Estimation by Least Squares and by Maximum Likeli- 
hood; Z. W. Birnbaum, On a Use of the Mann-Whitney Statistics; Herman Chernoff 
and Herman Rubin, The Estimation of the Location of a Discontinuity in Density; 
Aryeh Dvoretzky, On Stochastic Approximation; Sylvain Ehrenfeld, Complete Class 
Theorems in Experimental Design; F. Elfving, Selection of Nonrepeatable Observations 
for Estimation; Ulf Grenander and Murray Rosenblatt, Some Problems in Estimating 
the Spectrum of a Time Series; J. L. Hodges, Jr. and E. L. Lehmann, Two Approzima- 
tions to the Robbins-Monro Process; Wassily Hoeffding, The Role of Assumptions in 
Statistical Decisions; Samuel Karlin, Decision theory for Pélya Type Distributions. 
Case of Two Actions, 1; L. Le Cam, On ‘he “ symptotic Theory of Estimation and Testing 
Hypotheses; Herbert Robbins, An Empirical Bayes Approach to Statistics; Murray 
Rosenblatt, Some Regression Problems in Time Series Analysis; Charles Stein, Effi- 
cient Nonparametric Testing and Estimation; Charles Stein, Inadmissibility of the 
Usual Estimator for the Mean of a Multivariate Normal Distribution; B. L. van der 
Waerden, The Computation of the X-Distribution. 

Volume II includes: David Blackwell, On a Class of Probability Spaces; S. Boch- 
ner, Stationarity, Boundedness, Almost Periodicity of Random-V alued Functions; K. L. 
Chung, Foundations of the Theory of Continuous Parameter Markov Chains; A. H. 
Copeland, Sr., Probabilities, Observations and Predictions; J. L. Doob, Probability 
Methods Applied to the First Boundary Value Problems; Robert M. Fortet, Random 
Distributions with an Application to Telephone Engineering; J. M. Hammersley, The 
Zeros of a Random Polynomial; T. E. Harris, The Existence of Stationary Measures 
for Certain Markov Processes; Kiyosi It6, Isotropic Random Current; Paul Lévy, A 
Special Problem of Brownian Motion, and a General Theory of Gaussian Random Func- 
tions; Michel Loéve, Ranking Limit Problem; Eugene Lukacs, Characterization of 
Populations by Properties of Suitable Statistics; Karl Menger, Random Variables from 
the Point of View of a General Theory of Variables; Edith Mourier, L-Random Elements 
and L*-Random Elements in Banach Spaces; R. Salem and A. Zygmund. A Note on 
Random Trigonometric Polynomials. 

D. D. F. 


How to Calculate Quickly. Henry Sticker. New York: Dover Publications, Inc., 1955. Pp. 
256, 5 X74. $1.00. Reprint. 


I. A. Stzcun, National Bureau of Standards 
HE major portion of the book is a drill book in mental arithmetic with a small sec- 
tion on short cuts. A table of prime and composite numbers from 1 to 625 is given 
as well as the answers to the problems. It aims to develop number sense and should 
prove to be a valuable practice book for students and ordinary laymen. 


Principles of Numerical Analysis. Alston S. Householder. New York: McGraw-Hill Book 
Company, Inc., 1953. Pp. x, 274. $6.00. 
D. Teicuroew, National Cash Register Company 


HIs book contains no material relating directly to statistics, except a brief outline 
of the Monte Carlo method and a short discussion of statistical assessment of error 
of computation. However, it should serve as a useful reference book for statisticians 
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who make use of digital computers and as a textbook for a course in numerical analysis. 

Householder states in the preface that “This is a mathematical text book rather 
than a compendium of computational rules.” The subjects which this text book covers 
are: the art of computation; numerical solutions of linear and non-linear systems of 
equations; interpolation and other methods of approximating functions; numerical 
integration and differentiation; and the Monte Carlo method. In addition, the book 
contains an extensive bibliography, the length of which testifies to the thoroughness 
with which the author has examined the subject. This book represents a pioneering 
effort in that it was the first to be written with high-speed digital computation in mind. 


Office Work and Automation. Howard S. Levin. New York: John Wiley and Sons, Inc., 
1956. Pp. xvi, 203. $4.50. 


Joun R. SpevuMan, Arthur Andersen & Co. 


rricE Work AND AUTOMATION provides a general background in some of the 
recent developments in data processing equipment and scientific management 
techniques. 

The first portion of the book covers the operation and use of “common language” 
data recording equipment and electronic equipment for data processing. This material 
is presented clearly and informatively, and anyone desiring a knowledge of how this 
equipment functions would find this a handy primer. Of greatest significance to the 
businessman, however, is the theme running through the book pointing out that 
this equipment opens the way to a type of office which would have as its objective the 
processing of “information” necessary for management decisions, rather than the 
historic role of the office as a place for recording only the financial history of the organ- 
ization. 

The last section is devoted to a discussion of operations research techniques and 
their place as 2 management tool. Included here is a discussion of the basic tools which 
the operations research man can bring to bear on business problems. These tools are 
demonstrated through the use of hypothetical illustrations, such as the use of linear 
programming techniques for the solution of the “transportation problem” and the use 
of sampling methods for quality control problems. 

Considering the nature and the type of information contained in this book it would 
be of primary interest to businessmen in the supervisory and management levels who 
are concerned with office operations and the development and use of data processing 
systems within their own organizations. 


Readings in Market Research: A Selection of Papers by British Authors. Frederick Ed- 
wards, Editor. London: The British Market Research Bureau Limited, 1956. Pp. xxxvi, 
235. $5.00. 


Lronarp Kent, Needham, Louis and Brorby, Inc. 


HE aim of this volume is to provide a representative selection of articles portraying 
samc developments in market research in Great Britain. To accomplish this 
objective, the three-man selection committee, presumably members of the Market 
Research Society (which was founded in 1947, and the counterpart of the American 
Marketing Association), has chosen 15 articles by British authors published since 
1945. In addition, a survey of the development of market research with an assessment 
of its present status in Great Britain is contained in an introductory statement. 

The statistical approach characterizes practically all the articles included. Within 
this framework the topics primarily dealt with are the methods and procedures of 
conducting ad hoc sample surveys—the collection of data at intervals, or perhaps only 
once, from different groups of people forming samples of the relevant population. 
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Such a restriction in coverage logically follows from the restrictive definition of mar- 
ket research given in the introductory chapter. Here the field of market research is 
defined as the collection and analysis of information the businessman wants about 
the market for his particular product in relation to the market for competitive prod- 
ucts. For such a purpose, specially designed sample surveys are admirably suited to 
meet particular needs of individual manufacturers. 

The articles fall into three categories: (1) technical problems involved in market 
research; (2) applications of techniques: (3) problems encountered by organizations 
specializing in certain types of surveys. 

The six papers devoted to technical aspects of market research deal with sampling 
methods pertaining to problems of sample design, mechanics of drawing the sample, 
interviewing, the selecting and training of interviewers, coding, and tabulation. 

Of the four papers under the heading of applications, only one gives a picture of the 
value and limitations of various types of market research from the viewpoint of a 
manufacturer. The others in this group are devoted primarily to areas of advertising 
research, such as retail audits, consumer panels, and the history and significance of 
readership studies for newspapers and magazines. 

The three papers dealing with organizations include the measuring of public opinion 
by the Gallup poll, determination of audience size and reactions to the radio and tele- 
vision programs of the BBC, and surveys undertaken by the Government Social 
Survey to aid in the formulation of policy decisions by government departments. The 
Social Survey is an organization established and maintained by the government, and 
is somewhat similar in its operation to our Census Bureau. 

The introductory chapter, dealing with the status of market research in Great 
Britain, will undoubtedly be of greatest interest to U. 8S. readers—particularly those 
engaged in marketing and advertising research. Although the U. S. is the traditional 
home of “scientific marketing,” it may come as a surprise to many that market re- 
search in Great Britain since 1945 has achieved a status almost equivalent to that 
in this country. Similar needs have provided the impetus for the development of 
market research in both countries. The widening gap between producer and ultimate 
consumer brought about by large-scale organization and the many brands competing 
for the consumer’s dollar, as well as the growth of advertising, are phenomena com- 
mon to both economies. Market research has been resorted to in order to reduce the 
area of uncertainty surrounding business decisions by improving the supply of market 
information available to businessmen. 

Although there are basic differences in the characteristics of the economies of the 
two countries, the structure of market research in Great Britain consists of the same 
types of organization as in the U. 8.: (1) private market and opinion research agencies 
which sell their services to other firms; (2) the market research departments of manu- 
facturing and distributing firms, advertising agencies, and public corporations. Mar- 
ket research in Britain is no less concerned with fact-finding than in the U. 8., and 
the lack of emphasis on the predictive value of the research results is a common fault 
of researchers in both countries. The problem areas, chiefly in the fields of product de- 
velopment, media research, and tests « f advertising effectiveness are largely the same. 
With one important exception, there is little difference in the methods and procedures 
used to explore these areas. The exception is that British market researchers, as yet, 
have not become too involved with the fashionable mistake of reading profound psy- 
chological problems into every facet of marketing activity. Such indulgence in the 
U. 8. is known as motivation research. 

The three papers of special interest to statisticians are: (1) The Register of Electors 
as a Sampling Frame; (2) The Proportion of Jurors as an Index of the Economic 
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Status of a District; and (3) The Methods of the Savings Survey, the final paper in the 
volume. 

The U. 8. statistician would find little that is new in the papers on sampling method 
and interviewing, particularly if he has read such volumes as Hansen, Hurwitz and 
Madow, Sample Survey Methods and Theory, Cochran, Sampling Techniques, and 
Hyman et. al., Interviewing in Social Research. 

This reviewer finds it difficult to justify the inclusion of Kendall’s article on “the 
Statistical Approach” in a volume entitled Readings in Market Research. This was 
Kendall’s inaugural lecture on his appointment as Professor of Statistics at the Uni- 
versity of London, and first appeared in Economica in May, 1950. It is a delightfully 
philosophical orientation to modern statistics, and should be required reading for 
anyone who is the least bit curious about statisticians and their work. 

The Register of Electors is available for the majority of organizations conducting 
sample surveys of the general population in Great Britain. The Register is compiled 
once a year on the qualifying date of November 20. It is published on March 15 fol- 
lowing, and remains in force without revisions until March 15 of the following year. 
The arrangement of the Register is particularly suited for applying random sampling 
methods to multi-stage sampling procedures. In urban areas the Register for each 
polling district is arranged in streets which are listed in alphabetical order. Within 
each street, house numbers are arranged in numerical order. In addition, the registered 
electors at each address are listed in alphabetical order. In rurai areas the electors in 
each polling district are arranged in alphabetical order of surnames. 

The Register suffers from two main defects: (1) failure of all qualified voters to 
register; and (2) lack of continuous revision to take account of deaths, newly qualified 
electors, and changes of address. Sample surveys have been conducted for the pur- 
pose of estimating these deficiencies, and to determine the biases likely to result from 
them. The conclusion is that the Register of Electors may be used as a satisfactory 
sampling frame for individuals and households, provided some practical procedure 
is used for dealing with the problem of movers. 

The Register of Electors also indicates the names of all persons liable for jury serv- 
ice in England and Wales. Since there is a property qualification for jurors, the per- 
centage of registered electors in any area has been found to be a reliable indicator of 
the economic status of the area. As a matter of fact, significant correlations have becn 
found between this J-Index, and such other indices as proportion in upper income 
group, industrialization index (the ratio of the taxable [rateable] value of all industrial 
property in the area to the total taxable value of the area), infant mortality rate, and 
income index. This J-Index has been found to be of particular use in stratifying subdi- 
visions of administrative districts, such as polling places and parishes, in order that 
they can be used as sampling units in multi-stage sample designs. 

The third paper of particular interest to statisticians provides a detailed description 
and analysis of the methods and procedures used in the planning and organization of 
the Oxford Income and Savings Survey. Its purpose was to provide estimates of the 
distribution of liquid assets among individuals, and of the volume of personal saving 
and its relation to personal income. This project is similar to, and used extensively 
the experience of, the Surveys of Consumer Finances in the U. 8., conducted for 
the Federal Reserve Board by the Survey Research Center of the University of Michi- 
gan. Response rates and the reliability of results are discussed in detail. The point is 
well made that sampling error is only one of several possible sources of error in a sam- 
pling survey, that serious errors are more frequently caused by faulty reporting and 
processing than by sampling fluctuations. Of particular interest is the finding that the 
standard error of a two-stage design is greater in the case of attributes than in the case 
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of variables. For example, the actual standard error of mean gross income from a two- 
stage design was only 5% greater than the simple random standard error. On the other 
hand, the proportion of income units owning a car had a standard error 40% greater 
than the sample random standard error. These findings indicate that in Great Britain 
there is much less variability between one area and another in average income than in 
the proportion of people who own cars. Perhaps careful analysis of similar statistical 
findings in the U. 8. might turn up similar differences in variability between attributes 
and variables among geographical areas. 

The U. S. reader engaged in advertising research will find very little that is new in 
the articles on media and audience research. These areas, judging from the examples 
here, are much further advanced in this country, particularly with reference to the 
types of problems that researchers attempt to solve. 

To sum up, those interested in the application of research to marketing problems in 
Great Britain will find the series of articles included in this volume rather disappoint- 
ing. There is too much emphasis on how sample surveys are conducted. This criticism 
is not meant to detract from the lucid style in which the material is written. There are 
no instances of serious error in methodology or in presentation. 


Consumer Expectations, 1953-1956. George Katona and Eva Mueller. Ann Arbor, Michi- 
gan: Survey Research Center, University of Michigan, 1956. Pp. 143. $3.50. 


Herman G. Roseman, University of Pennsylvania 


} pe monograph is primarily concerned with the uses in economic forecasting of sur- 
vey data on consumers’ attitudes and expectations. The main source of data is a 
privately financed series of attitude surveys, which is supplemented by data from the 
Federal Reserve Surveys of Consumer Finances. The attitudes and expectations 
examined are of three sorts: evaluations of general business conditions, evaluations of 
personal financial situations, and plans and attitudes regarding the purchase of con- 
sumer durables, especially automobiles. 

The authors have severely limited the scope of this “brief report.” They have not 
attempted to show the relations between attitudes and nonattitudinal characteristics, 
nor have they tried to use attitudinal and nonattitudinal data jointly as predictors. 
They make no rigorous tests of the hypothesis that consumers’ attitudes change auton- 
omously (i.e., independently of economic influences), or even of the hypothesis that 
attitudinal data yield better predictions than nonattitudinal data. They do not con- 
sider closely the relative reliability of the various attitudinal indicators. Thus, their 
monograph is limited, for the most part, to outlining the changes in consumers’ atti- 
tudes over the past few years, and making rough comparisons of these changes with 
changes in aggregate purchases of consumer durables. 

The authors further restrict themselves, in the main, to the problem of predicting 
the direction of changes rather than their magnitudes. They note, “It has been found 
during the past few years that predictions of consumer behavior derived from atti- 
tudinal and expectational data were more satisfactory when they concerned the di- 
rection than the magnitude of forthcoming developments.” (p. 9) But this restrictiou 
of the analysis limits greatly its potential usefulness and increases the difficulty of 
testing, since changes in the direction of economic movements occur relatively infre- 
quently. Furthermore, greater accuracy in predicting the direction of changes may not 
necessarily make this nonquantitative method preferable to a quantitative predictor 
which indicates which changes are likely to be the important ones. 

After the introductory chapter outlining scope, purpose, and method, there are two 
chapters relating changes in attitudes to general economic changes in 1953-55. It is 
held, on the basis of a general correspondence between the two, that attitudinal data 
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are of some use in predicting consumer behavior. This section is more historical than 
analytical. 

There follows a chapter on consumers’ plans to purchase automobiles. Using data 
both from the Federal Reserve and their “own” surveys, the authors make “predic- 
tions” of automobile sales for the period 1950-55. These “predictions” are made by 
simply multiplying the total number of American families (or spending units, depend- 
ing on the survey used) by the percentage of survey units expressing intentions to buy 
automobiles, and adjusting for government and business purchases (on the assump- 
tion that they account for fourteen per cent of total registrations). The prediction is 
then converted to index form. Separate indexes were constructed for intentions to buy 
new cars and intentions to buy new and used cars. The indexes were compared by the 
authors with new car registrations during the next twelve months. Although sizable 
errors appeared in some years, the general direction and rough magnitude of changes 
in automobile registrations were fairly accurately “forecast” by the survey data. It 
should be noted, however, that according to tables of sampling errors provided by the 
authors, changes of less than two percentage points in the proportion expressing inten- 
tions to purchase new cars are statistically insignificant at the .05 level (for sample 
size of about 2000). It would seem to follow that indicated changes of less than twenty 
per cent of automobile sales would be statistically insignificant! 

The fifth chapter, on housing demand, is mainly concerned with motivational an- 
alysis rather than prediction. Recent home-buyers were asked to give reasons for 
their purchases; others were quizzed as to whether or not they felt dissatisfied with 
or unsettled in their current dwellings, and why. Most of the replies which could be 
classified related to the desire for more living space, the desire to live in a better house 
or in a better neighborhood, and the desire to own a home for financial reasons. Al- 
though the relative importance of motives is difficult to determine from survey data, 
the authors are impressed by the large proportion expressing the desire for “upgrad- 
ing” of residence. They state, “Traditionally, residential construction has been viewed 
as a function of net family formation.” (p. 76) However, “there now exists replacement 
needs of upgrading needs of such magnitude that they represent an important addi- 
tional factor in the housing market.” (p. 80) How the authors know that the need for 
upgrading has become an “additional” factor is unclear, since they seem to have 
survey data for only 1950 and 1955, years for which the differences are small and for 
which, as they themselves admit, the surveys are not comparable. And of course tra- 
ditionally a number of factors other than net family formation—income, etc.—have 
been considered important determinants of housing demand. 

In the following chapter the authors tentatively present an “experimental” index 
of consumers’ attitudes. It is essentially a simple unweighted index of the percentage 
differences between “favorable” and “unfavorable” responses to each of eight attitu- 
dinal questions, including plans to buy. The index is compared with an index of aggre- 
gate sales of consumer durables as a percentage of disposable personal income and 
some agreement is found—which is essentially the conclusion of chapters two and 
three. While it might be considered premature to criticize the method of construction 
of this index, or its validity, it is surely legitimate to question the purposes of con- 
structing it. If the authors’ intention is merely to construct a summary measure indi- 
cating expected direction of change, it would seem simpler just to compare the num- 
ber of indicators showing significant improvement with the number showing signifi- 
cant deterioration. If their purpose is to construct an index to be used as a quantita- 
tive predictor, perhaps included in a forecasting model, then they should have tested 
in that way with their “experimental” indicator. 

The authors seem of the opinion that at times consumer behavior changes autono- 
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mously, i.e., independently of other economic changes. For this reason they seem to 
feel that attitudinal data must be used to supplement nonattitudinal data in forecast- 
ing. But even if we were to suppose that there is some degree of consumer autonomy, 
it would still have to be shown that this economically “perverse” behavior is persistent 
enough and of great enough quantitative importance to merit being taken into ac- 
count. It is at least conceivable, even if this “perverse” behavior exists, that if it is too 
volatile, prediction might be impaired through the use of ex-ante in addition to ex- 
post data. That is, it would have to be shown that the use of attitudinal data signifi- 
cantly improves (or makes less costly without reducing the accuracy of) the predic- 
tions gotten through the use of nonattitudinal data alone—and this the authors have 
not even attempted in the present volume. 


National Income Visualized. Arthur O. Dahlberg. New York: Columbia University Press, 
1956, Pp. x +117. $3.50. 


WituiaMm I. Greenwa.p, City College of New York 


HIS review seeks to do no more than suggest something of the effect produced by a 

careful reading of a book with one-third tables, one-third text, and one-third picto- 
graphs. The book is an elementary canter for the nonprofessional student, and is not 
meant to impress other economists. As far as I have been able to discover, the literary 
portion is impeccable in exposition. It is simple and direct, and obviously created out 
of thought, vitality, and enthusiasm. I assume the author accomplished what he set 
out to do. He is interested in a stock economic technique rather than economic sub- 
stance, expounded with a maximum of descriptive detail and a minimum of analysis. 
He ignores the idea that the most valuable product of a course in economics is a mini- 
mum of technical knowledge and tool-making and a maximum of analytic tool-using. 
However, he agrees with the principle that full and correct learning, initially, is eco- 
nomical learning. 

The book deals with the macro-economic aggregates of national income and the 
relations amongst the components. Since national income accounting, in all its vari- 
ants, is statistical in nature and involves flows of receipts and expenditures, visual 
symbols are used for illustration and explanation. Major segments of the economy are 
represented by buildings, exchanges between segments by pipes, magnitudes of trans- 
actions by pipe thickness (occasionally meters), and directions of movement by arrows 
and shadings from white to black. 

The author makes the claim of an original contribution in the area of pedagogics. 
Since the main idea in the book is one of approach, it is hard to compare with standard 
books in the same area. It is frankly experimental, but not sufficiently unorthodox to 
conceal the underlying rationale. The author believes that language is incomplete and 
verbalism inadequate, a modern variation of the ancient Chinese proverb that one 
picture equals ten thousand words. Pictographs are used to explain and illustrate the 
text and crystallize notions concerning national income aggregates. Diagrams are 
most successful when used to improve perception of concrete phenomena, such as 
economic groups and interlacings in accounts. The problem of improving concepts 
and analysis is not faced in the book. Visualization is not completely separate from 
verbalization, so that the symbolic representation is not a substitute for but a sup- 
plement to the textual diseussion. 

A textbook in national income accounting with a strong heuristic motivation is 
plainly a reasonable project. However, is the result pedagogically satisfactory and 
does it show important product differentiation? My feeling is that it is a good book of 
its kind—but I have nagging reservations about the kind. Admittedly such a reaction 
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must be personal and impressionistic in view of the absence of objective models and 
standards in the same field. 


National Income Statistics: Sources and Methods. Central Statistical Office. London: Her 
Majesty’s Stationery Office, 1956. Pp. viii, 387. 25s. 


Ropert R. Natuan, Washington, D. C. 


XK RECENTLY as a couple of decades ago, there was relatively limited use of national 
income statistics either for analytical or policy purposes. The widespread adop- 
tion of national income data for all kinds of purposes has been a rather recent phenom- 
enon. 

Prior to the publication of the first official estimates of national income in the 
United States in 1934, there had been varied pioneering efforts which served not only 
to develop and clarify the concepts, but also to point the way with respect to methods 
of estimate. However, it was not until the official figures began to appear on an annual 
basis that national income measurements achieved commonplace status among econ- 
omists and statisticians, as well as in the general realm of government, labor, and 
business. Today, national income statistics provide the most comprehensive and valu- 
able single category of data describing the functioning of our economy. 

As stated on the fly leaf of National Income Statistics: Sources and Methods, this is 
the first detailed explanation to be published of the concepts underlying the United 
Kingdom’s national income statistics and also of the manner in which the estimates 
are developed. Heretofore, some definitions and explanations have appeared in the 
annual Blue Book which contains the details of British national income estimates. 
The present volume will preclude the need for lengthy annual discussions of concepts 
and methods in the Blue Book. 

About one-fifth of National Income Statistics: Sources and Methods is devoted to 
explaining the concepts and the general usefulness of the annual estimates. The rest 
of the book is largely confined to detailed descriptions of the sources of material and 
the methods of utilizing the basic data in deriving the income estimates. Within each 
chapter dealing with the sources and methods, there is a brief and useful section en- 
titled “General Description.” These sections in essence supplement the more general 
comments contained in the early chapters. 

The volume will be extremely useful for many purposes. The degree of the detail 
dealing with sources and methods is certainly adequate to achieve the objective set 
forth in the book, namely to provide the user with sufficient information to judge the 
meaningfulness of the results and the reliability of the figures. For the technician in 
the field of national income, the book will be helpful in suggesting techniques and de- 
vices for making estimates where precise data are not readily available. Even though 
the source material in Great Britain is not always identical with that in the United 
States, the general similarity of methods of estimation is rather marked. This derives 
in part from the fact that over the years there has been excellent cooperation among 
national income estimators throughout the world. Yet thorough and detailed de- 
scriptions, such as appear in this volume, will certainly be valuable to technicians in 
many countries wkere national income estimating is still in a pioneering stage. 

The earlier chapters of the book will be far more useful to the nonnational income 
technicians than to the latter. It will not add to the knowledge of those who have 
followed the mass of technical literature that has become available over the past dec- 
ade or two. The conceptual discussion in National Income Statistics: Sources and Meth- 
ods is reasonably elementary. It was written for the layman as well as for the market- 
ing specialists, economic policy makers, business analysts, and general users of eco- 
nomic data. 
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It is comprehensive enough not only to explain the fundamental concepts, but also 
the general limitations as well as the usefulness and limitation on uses. Here and there 
the descriptions become somewhat technical, but by and large an excellent job has 
been done in translating the concepts into simple and meaningful terms for the non- 
technician. 

There are some deviations between the estimates for the United Kingdom and 
those developed in the United States. For example, in the United Kingdom statistics, 
all profits are regarded as income earned by the owners of capital. Consistent there- 
with, interest and dividends are treated as transfer incomes. In the United States, 
interest, dividends and undistributed profits are all listed separately and treated as 
different types of income flowing to or accruing to the owners of capital. The British 
estimates are presented only in gross terms, no estimates being included for capital 
consumption, which estimates must be deducted from the gross figures to arrive at 
measurements of net income. Description of differences between the British estimates 
and other estimates is handled with clarity in the book. 

Chapter Two, which is entitled “The Framework of Social Accounts” attempts 
with reasonable success to demonstrate how the national income statistics are organ- 
ized into social accounts, which reveal the operations and inter-relations of different 
segments in the economy and different aggregates of economic activity. This is a use- 
ful discussion, particularly for those who are concerned with the application of na- 
tional income statistics to intensive economic analysis. 

An interesting attempt has been made to provide the user with judgments as to 
the accuracy and reliability of the estimates. Obviously, a considerable degree of sub- 
jectivity is involved in such an appraisal, but it is certainly worthwhile to have the 
views of the estimator on this matter. It is not surprising to find that the margin of 
error is greater for income from self employment than labor income, or that the esti- 
mates of capital formation are substantially less acurate than many categories of 
income and expenditure. It is a bit disturbing to know that net income from abroad 
is classified as one of the least reliable components. One might well be curious as to 
how the estimators in the United States would classify the reliability of their esti- 
mates and how these evaluations would differ from those of the British technicians. 

It would hardly be feasible to attempt either to explain or to judge the way in 
which basic data have been used or the techniques or judgments underlying the spe- 
cific estimates in each sector and for each type of income. However, it can be said that 
anyone who works with statistics and is required to exercise ingenuity in the selection 
and application of data will profit substantially from a study of the specific descrip- 
tions in this volume. 

National Income Statistics: Sources and Methods should be of considerable value and 
help to all those who are intensive students of national income estimates and to all 
users of these highly important statistics. 


Personal Income During Business Cycles. Daniel Creamer (with a foreword by Geoffrey H. 
Moore). Princeton, N. J.: Princeton University Press, 1956. Pp. xliii, 166. $4.00. 


Francis M. Boppy, University of Minnesota 


HIs study was prepared by Daniel Creamer as a staff member of the National 
Bureau of Economic Research and follows the tradition of much of the work of 
the Bureau in being a careful and comprehensive summary of the numerical informa- 
tion available in the area and an analysis of the main characteristic patterns of rela- 
tionships among the major components and over time. 
Following a brief introductory chapter, Chapter 2, “Long Term Shifts in the Per- 
sonal Income Structure,” traces changes in personal income (before taxes) and in its 
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major components from 1913 to 1951 (the last year available at the time of the study). 
The declining percentage of property incomes, and of farm and nonfarm proprietor’s 
net incomes, with a corresponding rise in labor income’s percentage of the total are 
the main features of the story, with the special impact of World War I (and to a lesser 
extent, World War II) in getting these long run changes off to a fast start. 

In Chapters 3 and 4, Creamer takes up the timing of turning points and the ampli- 
tude of the cyclical variations in personal income and its main components. Here the 
monthly data on personal income, available only since 1929, and on business activity 
gave eight turning points to be studied. Personal income and all the major compo- 
nents except farm income show turning points that generally coincide with, or lag 
slightly behind, the turns of the general business cycle. Farm income cycles have a 
very low conformity with the general cycle, and also show the largest amplitude of 
cyclical variation of all the components. 

Chapters 5 and 6 give an analysis of the timing and amplitude of cyclical changes 
in labor and property incomes, by types. Nothing startling or unexpected turns up in 
the data, but the picture of what has happened to these components is made quite 
clear. 

In Chapter 7 Simon Kuznets’ work on the Shares of Upper Income Groups in In- 
come and Savings is the basis for the analysis of the cyclical changes in personal in- 
come by size of income (upper 1 per cent, 2nd to 7th per cent, and the lower 93 per 
cent). Here interesting and distinctive patterns appear of the relative shares of these 
three groups over time (1917-1948) and at the peaks and troughs of the cycles. The 
distinctive patterns in the “average” cycle are brought out and commented on es- 
pecially by Geoffrey Moore in his foreword. 

In Chapter 8 the impact of various government programs: agricultural, unemploy- 
ment compensation, and the federal personal income tax, are investigated to discover 
the extent and degree to which they acted as built-in stabilizers of income by acting 
in a counter-cyclical fashion. Chapter 9 is a summary of the findings of the study. 

The appendixes, running from page 115 to page 161, contain the basic data, with 
brief notes and comments on the sources of data used, and on the methods used in 
tying together and adjusting the data from various sources for purposes of the study. 

This is a very competent summary and analysis of the basic data on personal in- 
come in the United States and will be a most useful reference and source of data for 
all those interested in the field. 


Resources and Output Trends in the United States since 1870. Moses Abramovitz. New 
York: National Bureau of Economic Research. Occasional Paper 52, 1956. Pp. 23. $.50. 


Tueopore W. Scuuurz, University of Chicago 


I. 1s still widely held that economics and the other social studies are book disciplines; 
important contributions do not come in small packages; it always takes a treatise. 
This study by Abramovitz is a notable demonstration to the contrary: an important 
contribution to economics is reported in a few pages and others will build on the “find- 
ings” presented in this paper. 

The pioneering and basic work of Kuznets' in measuring income and wealth are 
now well known. Goldsmith’s* outstanding and massive stud: of savings in the United 
States is also at hand (three volumes and 2,246 pages!). The question now is: what 
insights can be gained about economic development from these basic statistical char- 


1 See especially “Long-Term Changes in National Income of the United States of America since 1870” in 
Income and Wealth of the United States edited by Simon Kuznets, Cambridge, England: Bowes and Bowes, 1952. 

2? Raymond W. Goldsmith, A Study of Savings in the United States, Princeton, New Jersey: Princeton Univer- 
sity Press. Volumes I and II, 1955, and Vol. IIT, 1956. 


282 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1957 


acterizations of income, wealth and savings? In the main this is the query to which 
Abramovitz has advanced himself. Schmookler’s* paper was a forerunner and that of 
Kendrick,‘ appearing at the same time as the paper under review, is complementary 
in analyzing changes in inputs and output in the United States. Fabricant anticipated 
Abramovitz’s findings in his thoughtful and speculative annual report of the NBER 
of three years ago. 

Abramovitz’s measures show that the net national product (output) of the econ- 
omy of the United States rose at a per annum rate of 3.5 per cent during the period 
from 1869-78 to 1944-53, while labor and capital together (the input) rose only 1.72 
per cent per annum, or at a rate which was a little less than one half that of output. 
Net national product in 1944-53 had reached an index of 1,325 with 1869-78 as 100; 
and capital and labor in an index of the input had risen to only 381.* The paper also 
discusses the long term movements in the trend and the short term fluctuations in 
output and input. Whereas two charts on “Trends in the Growth Rates, 1869-1953” 
are presented, the underlying basic data by decades unfortunately were not made a 
part of the paper. 

There is no doubt that it will be rewarding to examine and reexamine what the two 
sets of inputs represent and what the weights that were used stand for, to determine 
how satisfactory they are for particular analytical purposes. Abramovitz’s results, 
however, are clear: additional inputs of the conventional types account for only 
about one-half of the remarkable increase in output which the economy of the United 
States has been achieving. About half of the . crease is thus unexplained by such con- 
ventional inputs. Where does the large unexp'sined part come from? This question 
now becomes the all-important issue, so it woul. .ppear, in analyzing our kind of eco- 
nomic development. Meanwhile, there is the hope that Abramovitz’s results will give 
pause to those who contribute to that welter of loose thinking which rests on the belief 
that additional inputs of the conventional types are aecessarily the key to economic 
development. 


Trends in Employment in the Service Industries. George J. Stigler. Princeton, N. J.: 
Princeton University Press, 1956. Pp. xviii, 167. $3.75. 
Joun M. Pererson, University of Arkansas 

fer National Bureau of Economic Research has published a series of studies of em- 

ployment and output trends for manufacturing, mining, agriculture, public utili- 
ties and transportation in the United States. The remaining major industry sector, 
the service industries, has been partially surveyed in studies of government, trade, 
domestic service, and education. This final essay provides a summary and analysis of 
employment growth since 1870 for the service industries as a whole. 

The service industries have grown more rapidly than the rest of the economy and 
have increased their share of the total labor force from one fifth in 1870 to over two 
fifths in 1950. The share of the labor force also has expanded progressively in every 
major service industry group—trade, finance and real estate, education, other pro- 
fessional services and amusements, and government—with the persistent exception 
of domestic service. Thus, American experience generally conforms to Colin Clark’s 
thesis. However, the over-all trend has not kept a steady pace, and the patterne of 
growth have varied greatly among industry subcategories. 

* J, Schmookler, “The changing efficiency of the American economy, 1869-1938,” Review of Economics and 
Statistics, August, 1952. 

‘John W. Kendrick, Productivity Trends: Capital and Labor, New York: National Bureau of Economic Re- 
search, Occasional Paper 53, 1956. 

* Solomon Fabricant, “Economic Progress and Economic Change” a part of 34th Annual Report, National 
Bureau of Economic Research, New York, May, 1954. 


* If the armed forces are excluded from this measurement of the labor input the index becomes 361 instead of 
381. Labor in man-hours rose from 100 to 312 (to 290 if the armed forces are excluded) and capital to 993. 


Stigler seeks to find a rationale for these varied trends by a study of individual in- 
dustries. While he presents the available trend data for each major service industry, 
he limits the scope of his detailed analysis to only a few industries selected from several 
classifications according to industry characteristics. Thus, in one chapter he discusses 
retail trades, which serve consumers directly, while in another chapter on business 
services he separately analyzes wholesale trades. Among routine personal services he 
singles out for analysis domestic servants and barber and beauty shops. Among pro- 
fessional services he analyzes four traditional groups—-lawyers, doctors, college teach- 
ers, and military officers. 

As a result of his analysis of selected industries, Stigler finds no simple rules to ex- 
plain their diverse long-term trends, although he suggests that a common group of 
forces is operative in most of them. The most important are changes in tastes, which 
he associates with measurable changes in population structure and the degree of ur- 
banization, Also important have been such technological changes as increased worker 
skills, changing forms of business organization, specialization of firms, and the rise of 
mass communications. Also affecting employment at both ends of the occupational 
ladder have been such changes in the labor supply as restricted immigration, expan- 
sion of education, and the raising of professional training standards. 

Surprisingly, Stigler suggests as a hypothesis that the level of income has only a 
minor influence on secular consumption trends in service industries. He bases this 
hypothesis on the results of two types of empirical experiments. In his first group of 
experiments, he uses income elasticity coefficients obtained from consumer budget 
data (1919, 1935-36, and 1941) to predict: 1) employment changes (1920 to 1940) 
in seven service industries, 2) per capita consumption changes (1923-25 to 1946-48) 
for twenty foods, and 3) aggregate expenditure changes (1935-36 to 1948) in nine 
broad budget categories. The predicted changes did not correspond closely to the 
actual changes. In another group of experiments, he uses interstate comparisons to 
relate: 1) changes in per capita income to changes in trade employment (1920-40), 
2) changes in per capita income to changes in college enrollments (1940-50), and 3) 
average family income to servants per 100 families (1940). Again, the expected rela- 
tions did not oceur. ‘i‘hese failures of prediction based solely on income suggest the 
need for introducing other variables which are partially or wholly independent of 
income level, such as urbanization, educational level, and relative prices. Interestingly 
he finds inequality of income distribution closely related to the use of domestic serv- 
ants. 

Stigler does more in this study than provide a succinct summary of available trend 
data for a major industry sector: he also offers new insights concerning the forces in- 
fluencing these trends. It is unfortunate, however, that the scope of his study was 
limited and that his characteristic skill in economic and statistical analysis was not 
extended to the other service industries, which account for about half of the total em- 
ployment. A consequence, perhaps, is that the analysis is more exploratory and the 
findings more tentative than one might have expected in a final essay in the National 
Bureau’s series on this industry sector. Nevertheless, the book will be an indispensable 
reference and a stimulating source of research hypotheses for market analysts and 
students of economic development. 


Linear Aggregation of Economic Relations. H(ans) Theil. (Contributions to Economic 
Analysis, Vol. VII, J. Tinbergen, P. J. Verdoorn, H. J. Witteveen, eds.) Amsterdam: 
North-Holland Publishing Company, 1954. Pp. xi, 205. fl. 18.00. $5.00. 


Exseruarp M. Fets, University of California (Berkeley) 


A ITs inception—probably when Francis W. Dresch investigated implications of 
certain propositions of Frangois Divisia’s Economie Rationelle—aggregation 
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theory was closely akin to index-number theory and hence to statistical methodology. 
In a series of important contributions to the problem of aggregation, mainly in the 
mid-forties, this connection with statistics became strained and remote. Theil’s con- 
tribution is remarkable and unique in that it is written in hard statistical terms and 
avowedly prefers the practically pedestrian to the theoretically lofty. 

Theil distinguishes between and investigates in turn aggregation over individuals, 
over commodities, and over time periods. Naturally, most econometric models use 
multiple aggregates that include all three kinds. 

Right from the beginning, there are many strange and unsuspected phenomena. 
Consider, for instance, a set of linear and stochastic micro-equations that purport to 
show how each individual makes his sugar consumption depend on his personal in- 
come, the price of sugar, etc. It is customary to postulate a corresponding macro- 
relation showing how total sugar consumption depends on total personal income, a 
multiple of the average price of sugar, the size of the population, etc. The macro-vari- 
ables are simple sums or weighted averages of the micro-variables. One is likely to 
assume that the macro-parameters depend then only on the corresponding micro- 
parameters. Theil shows in his first theorem that this is not generall. true. That means 
the parameter describing the influence of total income on tota) sugar consumption 
generally depends also on the way in which individual families react to changes in the 
number of their members. Macro-reaction patterns may change completely while the 
micro-reaction patterns remain the same. It is only if all micro-parameters of the 
same kind have the same value—for instance, if al] families increase their sugar con- 
sumption by the same quantity when their income is increased by $100 per year— 
that noncorresponding micro-parameters have no influence on the macro-parameter 
in question. In other words: In this exceptional case, neither (a) the intercept of the 
macro-relation, nor (b) the parameter describing the effects of changes in the average 
sugar price, nor (c) the parameter describing the influence of population size depend 
on the ways in which individuals react to income changes. 

The dependence or lack of dependence of macro-parameters on noncorresponding 
micro-parameters is expressible in terms of covariances of weights and micro-param- 
eters. While it is now possible to say that the macro-intercept is equal to the sum of 
the micro-intercepts, plus a certain covariance correction, and a macro-slope is equal 
to the average value of the corresponding micro-slopes, plus a certain covariance cor- 
rection, these corrections may be so important as to bring about contradictions be- 
tween conclusions drawn from the micro-theory and those from the macro-theory. 

Theil devotes a large part of the book to careful, casuistic considerations of what 
kind of conditions micro-parameters or micro-variables have to obey in order to make 
the outlook for aggregation less than agnostic. There is accordingly a strong plea for 
complementing macro-studies by independent micro-studies, as it is a crucial fact 
that different statistical methods yield estimates of different macro-parameters be- 
cause the weights implied by the use of different methods vary from one method to 
another. 

Theil’s chapters on aggregation over several sets of individuals or commodities and 
over time periods follow similar lines. The conclusions, however, are not altogether 
analogous. Theil sees particular difficulties with the latter variety of aggregation pro- 
cedures whenever one faces time series in a rapidly expanding economy. There are 
also particular problems when the equations contain lagged endogenous variables. 

A special chapter is devoted to situations in which one of the explanatory variables 
of a macro-equation depends on the random disturbance of this equation—a chapter 
which is in a way a comment on the tenth monograph of the Cowles Commission and 
asserts, for instance, that identifiability of the micro-system does not imply and is not 
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implied by identifiability of the macro-system. Again, the micro-system may contra- 
dict the often-oceurring overidentifying prescriptions of the macro-system. 

After dealing with the properties of various kinds of “actual” aggregation and us- 
ing the results obtained, Theil investigates desirability criteria for “perfect” and 
“good” aggregation, namely how to keep the “aggregation bias” as small as possible. 
Theil sees no justification for taking this bias less seriously than other statistical bias 
phenomena. (Since Theil’s publication, empirical results concerning input-output 
matrices and businessmen’s expectations have shown curious aggregation effects.) 
Above all, downright contradictions between estimated micro- and macro-structures 
should be avoided. A suitable type of fixed-weights aggregation is one answer to the 
problem. Other remedies are too involved to be verbally rendered here, but perfection 
can always be obtained if a set of linear micro-equations is to be aggregated to a single 
linear macro-equation. For systems of simultaneous linear macro-equations the situa- 
tion is much less clear. 

Although clearly written and well produced, this is a difficult and frequently exas- 
perating book, but the fault for that probably lies in the nature of things, not in the 
author. There is a labyrinth of subscripts and superscripts, but the notation is con- 
sistent throughout. Most proofs or sketches of proofs are collected in the final chapter, 
which is only slight help. Theil has to be commended for patiently seeking and formu- 
lating positive statements in a constructive manner when it would have been easy to 
indulge in all-out negativisin. 

It is strange that Theil’s should be the only book on aggregation so far when the 
topic is so important. Not only is this kind of thing a solid bridge between economet- 
rics and economics proper, but the choice of an aggregation procedure is in a sense the 
choice of an economic theory. The degree of aggregation is the degree of abstraction. 
To have shown that this choice stands and falls with statistical considerations is to 
have shed a new bright light on the ancillary role of statistics for economics. 


Minimum Price Fixing in the Bituminous Coal Industry. Waldo E. Fisher and Charles M. 
James. Princeton, N. J.: Princeton University Press, 1955. Pp. xxxl, 523. $10.00. 


Jesse W. MarxuaM, Princeton University 


7 coal industry’s fortunes are on the upswing. Production in 1956 is expected to 
exceed a half-billion tons; productivity and wages are at an all-time high; work 
stoppages have been negligible for several years; prices are stable; and mine operators 
are making profits. A. H. Raskin of the New York Times recently reported that “coal 
and its miners are striking it rich.”! He predicted they would strike it richer. 

This book is a detailed chronicle of the bituminous coal industry in less fortunate 
days. Its authors have demonstrated an amazing capacity for exploring and exploit- 
ing voluminous source materials. The result is an excellent documentation of the 
events precipitating the enactment of the Bituminous Coal Conservation Act of 1935, 
the grounds on which the Supreme Court struck it down (its labor provisions and 
price-fixing provisions were constitutionally invalid), the subsequent events leading 
to the enactment of the Bituminous Coal Act of 1937, and the 6-year history of mini- 
mum price fixing in the coal industry under the 1937 act. 

As the authors point out, “the establishment of some hundreds of thousands of in- 
dividual mine prices was a gigantic task.” The Coal Commission greatly simplified the 
problem by resorting to averages. The coal industry was divided into 22 districts. 
The weighted average production costs, including normal profits, were calculated for 
each district and were coordinated to determine minimum prices for minimum price 
areas. After appropriate hearings the commission put the prices into effect. These 


1 Sunday, Oct. 14, 1956, p. E9. 


. 


286 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1957 


steps alone required most of the time the law was on the statute books: the law be- 
came effective on April 26, 1937 and expired on August 23, 1943; the first coordinated 
minimum price schedules did not go into effect until October 1, 1940. But even then 
the task had barely begun. Enforcement, in spite of rather severe punitive tax meas- 
ures, was no easy task. During the first 15 months of enforcing minimum prices 1,840 
violations were filed with the Bituminous Coal Division (the successor agency to the 
commission) ; it is certain that many evasions went unreported. 

Analysis of the coal price-fixing experience confronts insuperable obstacles: a 4-year 
period separated enactment of the 1937 Act and the conditions in the coal industry 
which precipitated it; the Act itself became effective over 3 years after its enactment 
but only two years before the coal demands of the second World War made it obsolete 
and irrelevant; meantime, it is not clear whether the course of the coal industry was 
influenced more by provisions of the Act or by their many violations and evasions. 
These difficulties may explain why, of the total space, an economic appraisal of this 
signal venture into public price fixing receives a scant 25 per cent, much of which is 
used to summarize the preceding 306 pages of detailed descriptive material and to 
present previous appraisals by others, notably Eugene V. Rostow,? Walton Hamilton,’ 
and Donald H. Wallace.‘ However, after analyzing and correlating various time series 
they conclude that public price fixing (1) added to rather than decreased excess (and 
inefficient) capacity; (2) hastened mechanization; (3) reduced work stoppages from 
labor disputes; (4) probably encouraged the substitution of other fuels for coal; (5) 
increased profits, and (6) raised labor standards. The beneficial effects, however, are 
illusory and short-lived. In the long run price fixing, by unbalancing supply and de- 
mand, aggravates the basic causes of instability. The authors conclude with the ob- 
servation that Congress’ stated long-run objective of industrial stability is not attain- 
able through compulsory cartelization and public price fixing. The time series corre- 
lations which produced this important policy conclusion are of questionable validity; 
the conclusion itself, however, is supported convincingly by the logic of price theory 
and one can only hope that Congress in its future deliberations accords it the serious 
attention it deserves. 

The broad synthesis of their study with public policy toward industry generally 
the authors leave to others. Why did Congress authorize price fixing and, in effect, 
a basing point system of pricing for coal at precisely the same time the antitrust agen- 
cies were prosecuting both pricing practices with renewed vigor? What economic and 
political forces lead Congress selectively to abrogate our traditional public policy of 
maintaining competition? What similarities are there between coal, agriculture, labor, 
and retail trade (Miller-Tydings) in this respect? The authors do not consider ques- 
tions such as these. On the narrower issue of price fixing as a solution to problems 
confronting the coal industry in the thirties the book can be recommended as a com- 
plete and competent performance. 


The Pattern of Financial Asset Ownership: Wisconsin Individuals, 1949. Thomas R. 
Atkinson. National Bureau of Economic Research, Financial Research Program. Prince- 
ton, N. J.: Princeton University Press, 1956. Pp. xviii, 176. $3.75. 


E. 8. Saaw, Stanford University 


HE burgeoning interest in determinants of the demand for financial assets, and in 
their implications for monetary and other financial policy, needs the empirical 
sustenance of more studies such as this. Atkinson, working under the joint sponsorship 


? “Bituminous coal and the public interest,” The Yale Law Journal, Yol. 50, No. 4, Feb. 1941, pp. 543-94. 
* “Coal and the economy—a demurrer,” The Yale Law Journal, Vol. 50, No. 4, Feb. 1941, pp. 595-612. 
* Economic Standard of Government Price Control, TNEC Monograph No. 32, Part IV. 
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of the National Bureau of Economic Research and the University of Wisconsin, has 
sampled individuals’ income-tax returns to the State of Wisconsin in 1949 and con- 
verted reported incomes from financial assets into inter-personal distributions of 
financial-asset holdings. His objective has been to probe factors that explain individual 
differences in preference among three classes of financial asset—time deposits and 
allied claims, debt instruments such as bonds and mortgages, and corporate stocks. 
The factors that he finds to be significant are differences in income and wealth among 
investors, in occupation, and in size of city of residence. 

Working up the income scale, Atkinson shows that preferences shift away from 
time deposits and similar assets, from mortgages and personal promissory notes, from 
listed and traded stocks of more modest yield toward corporate bonds, traded stocks 
with higher yields, and untraded stocks. Higher incomes are associated with a relative 
emphasis on equities, rather than on debt-type assets, and on equities of such com- 
paratively high yield that the damping effect of progressive income taxation on the 
flow of equity funds is open to doubt. Occupational classification of Wisconsin in- 
vestors reveals, among other things, that untraded stocks are favored by the “mana- 
gerial and self-employed,” traded stocks by professional employees, time deposits and 
similar assets by farmers and laborers. As for location of residence, equities appear to 
have been favored in urban areas, debt-type assets in the countryside. These conclu- 
sions and others are stated with due caution, and the reader may check most of them 
himself from the study’s excellent tabular appendix. 

Data on financial-asset preferences and holdings are difficult to come by. The survey 
technique has been criticized because blow-ups of samples miss by so very wide a 
margin the aggregates that can be objectively measured. Samples taken from records 
of debtors are not easily and reliably distributed among creditor classifications. Atkin- 
son has put his questions to neither creditor nor debtor, but has gone to a third source, 
income-tax returns in a state which requires reporting of specific sources of interest 
and dividends and capital gains. The author is very frank regarding the distinctive 
shortcomings of this alternative. His careful comparisons of data assembled by his 
technique and its alternatives demonstrate how insecure is our knowledge of financial 
behavior. 

Atkinson has applied the simplest of statistical methods to his problem, and has 
provided an admirably explicit set of laboratory notes on his procedures. He makes 
no attempt to wash out the effect of sampling error on his conclusions, all of which are 
stated in terms of no greater elegance than medians, frequencies, and bar charts. 
The reader may, as a result, wish to attach less importance to Atkinson’s conclusions 
about the relevance of his findings to economic policy affecting the flow of savings 
into investment than to his trials and tribulations in working over an unconventional 
source of financial information. ’ 


The Core of the City: A Pilot Study of Changing Land Uses in Central Business Districts, 
A publication of the Institute for Urban Land Use and Housing Studies. John Rannells. 
New York: Columbia University Press, 1956. Pp. xxix, 233. $5.50. 


Reena R. Isaacs, Harvard University 

i Is interesting to find that the author is an architect, Ordinarily, architects are 

not involved in inquiry and methodology of a statistical nature. However, John 
Rannells is not an ordinary architect. His curiosity regarding the problems of the 
“core” or central business district is one that currently is shared by many architects, 
architect-planners, and city plannerg Where the architects attempt to come to quick 
physical solutions, Rannells concerns himself with the causal factors and relationships 
of structures and activities and with developing a basic and multiple approach of a 
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graphic-statistical nature. Yet, an orderly architectural quality is evident in the 
structuring of his method; even a cursory view of his table of contents would show 
this. The book is unusually well organized in respect to the sequence in which sub- 
jects are developed. The table of contents itself is an outstanding model for writers 
in its succinct annotation and summaries. There are many illustrations; not all of 
these are easy to comprehend; in part, this is the fault of variations in scale or lack of 
indication of scale or keying, of drafting techniques, and of the printing process itself. 

An explanation of the book’s title, Core of the City, seems necessary since it is an 
expression which is being worked overtime these days to describe many situations. 
The title refers to a “core” as a single center; the text refers to “central district,” “civic 
center,” “heart of the city,” “hard core,” and “urban core.” However, unlike an apple 
a city may have one or more “cores,” balancing, imitating, rivaling, or competing 
with each other. These “cores” are not necessarily only business districts, but may be 
cultural, recreational, educational, or other groupings. Actually, Rannells refers only 
to “ .. . central district (which encompasses a somewhat broader area than the popu- 
lar “downtown” conception, (and) which is generally limited to the most concentrated 
area of commercial structures . . . .” Rannells had in mind Philadelphia’s concentrated, 
rectangular-shaped two-square-mile central business area. His use of this was for 
advantage of propinquity, and the availability of the Philadelphia Central District 
Study by the City Planning Commission and other data. Certainly, his method 
appears to relate well to the many problems of this particular, but atypical, area. 

What is the method? Rannells’ hypothesis (according to the jacket) is that “the 
value of the city’s physical assets is not inherent in the actual structures so much as 
in the use that is made of them.” According to the same source, he has developed “a 
theory that shows how the land-based activities and the structures in a city are inter- 
related,” and has introduced a “unique graphic-statistical method of measuring and 
comparing groups of activities in the central business district.” He has made quan- 
titative descriptions and comparisons of the activities that occur over the entire 
Philadelphia central area. The basic data for these methods consists of block-by-block 
listings of establishments classified as to kind of business. His methodology includes 
studies of: (1) distribution of establishments—all kinds—from which he has plotted 
distribution density diagrams: distributions are compared as to their degree of con- 
centration; (2) distribution of floor space: centers of gravity are determined by a 
system of coordinates; radii of dispersion are calculated and graphed; contours are 
also mapped to indicate concentrations and lower intensities of use; (3) distribution 
based on (1) and (2) combined; (4) distribution based on major business groupings, 
their relationships, proximity, relative size, and other comparisons; (5) linkage of 
persons and goods in terms of accessibility for the former and handling of the latter; 
and (6) “constellations” of activities and their inter-relationships. Numerous maps, 
charts, tables, diagrams, distribution curves, centers of gravity and density profiles, 
radii of dispersion, and comparisons are employed. 

How successful is Rannells in this effort? Application of the methodology to a 
single point of time does not reveal directions or amounts of change. Best use of the 
methods would require re-application periodically to determine trends. The relation- 
ships between public and quasi-public facilities and their use are not a part of the 
method; yet, this reviewer believes these must be taken into consideration in any 
study of central areas. The method seems cumbersome for use in the ordinary city 
planning office. Indeed, some of the descriptive language would be unfamiliar to 
planning technicians who would apply the method. This reviewer is not convinced of 
the universal applicability of the methods. 
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Rannells’ forecast, on the basis of his study, is that “the coming pattern of ur- 
banization appears to be one dominated by a network of strong concentrations tied 
together by transportation lines, not the general suburban sprawl that was formerly 
predicted. The freedom to locate at will (brought about by motor transport) is still 
strongly governed by efficiencies due to nucleation of activities.” However, he notes 
the need to relate the study of the city center to the entire city in terms of land-use. 
Before a “Theory of Locational Arrangement” can be determined, two studies addi- 
tional to The Core of tie City and Urban Traffic (Mitchell and Rapkin) are required; 
these would be on the subject of “linkages” and “nucleations.” Such a four-pait 
study would examine relationships of establishments to activity systems and to cen- 
ters for the central district as a whole and subcenters. Until this is done, Rannells’ 
present earnest study is necessarily exploratory and partial in its usefulness. 
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REPORT OF THE BOARD OF DIRECTORS, 1956 


It is pleasant to report that, as a result of sharp economizing and excellent cooperation 
from committees and individuals, the surplus for 1956 will be substantial. Detailed figures 
are presented in the 1956 report of the Secretary-Treasurer. The addition to surplus not 
only mskes up the loss incurred in 1955, but insures continued expansion of the Associa- 
tion’s activities. It is the continuing policy of the Board and Council to budget for an 
eventual surplus equal to about one year’s income. 


Publications 


Morris Hamburg, of the Wharton School, University of Pennsylvania, became the new 
editor of The American Statistician early this year. He succeeds Almarin Phillips who had 
been editor for three years, 1953-55. Under Mr. Phillips’ hand, a number of changes were 
made in policy and format, which increased the value and interest of this publication. 

The “Proceedings of the Business and Economic Statistics Section’? was authorized 
by the Board for the two years 1955-56. This volume includes many papers given at the 
Section’s sessions at the Annual Meetings of the Association in New York in 1955, and 
in Detroit in 1956. The price to members is lower than to non-members ‘$3.25 versus 
$3.75) and, in addition, the 1954 Proceedings may be purchased at half price when or- 
dered at the same time. While the sales of the 1954 Proceedings did not cover the cost of 
publication, it is expected that the 1955-56 volume will come closer to the “break even” 
point. As the Proceedings become more widely known and publicized and continuity is 
established, they are expected to become self-supporting. 

“Improving the Quality of Statistical Surveys,” a collection of papers desing primarily 
with the Federal Statistical Program and dedicated to the memory of Sam Weiss, was 
published this year in collaboration with the Washington Statistical Society. These origi- 


nal papers were presented at a special meeting of the Society. The price of the booklet 
has been set merely to recover printing costs. 


Ford Foundation Grant 


In response to a request from the Association for financial assistance in preparing an 
index to the Journal, which would be of interest to scientists in many disciplines, the Ford 
Foundation has granted a sum of $19,640. The new index will cover volumes 35-50, that 
is 1940-1955. The money received from the Ford Foundation will not cover printing costs 
—these will be borne by the Association. Preparation of the new index will probably re- 
quire about 18 months. 


Weiss Memorial Fund 


A committee, chaired by Lester Frankel, was appointed this year to study the best 
way to establish a memorial to Samuel Weiss. The committee recommended an assistance 
fund on the graduate level, to be placed at the University of Pennsylvania (Mr. Weiss’ 
Alma Mater) and administered by the Wharton School. After details had been worked 
out with the University, a letter of solicitation was sent to all members of the Association, 
and the request was repeated at the time of the balloting for officers. Donations received 
are being forwarded to the University of Pennsylvania. A further report on the sum col- 
lected will be made next year, after the major portion of the contributions is in. 
Redistricting 

In 1955 a proposal was made to redistrict the Association on a geographical basis (here- 
tofore the districts were determined by chapter group). A committee was set up, headed 
by John Tukey, to study the whole problem and make recommendations to the Board. 
As a result of the committee’s final suggestions, the Council veted to redistrict the Asso- 
ciation into ten areas, covering the United States and Canada (and including U. 8. Terri- 
tories). Since each district has two representatives on the Council, this increased the size 
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of that body by eight members. The new plan is on trial for a two-year period, after which 
it will be re-examined by the Council. The redistricting provides better representation and 
insures that members not in chapter areas will also have a vote for representatives from 
their district. The listing of the new districts, and a map showing boundaries, was sent 
to all members in November 1956, and will be run once a year in The American Statistician. 


Other Activities 


Agreement signed with ICA—A three-year contract has been concluded with the Inter- 
national Cooperation Administration whereby that agency will sponsor for membership 
in ASA foreign trainees who have returned to their native lands. The purpose is to give 
them an opportunity to keep in touch with current developments in statistics in various 
fields. ICA will pay two-thirds of the dues, and the applicant will pay one-third. The dues’ 
rate is at the reduced cost for persone outside North America. 

New Pittsburgh Chapter—The Bord has granted a charter to the newest chapter of 
the Association in Pittsburgh. The number of chapters is now 35. The Los Angeles Chap- 
ter was granted permission to change its name to the Southern California Chapter. 

Resolution for Howard Whipple Green-—At the Annual Meeting in Detroit in Septem- 
ber 1956, the Association honored Howard Whipple Green’s 25 years as Chairman of the 
Committee on Census Enumeration Areas with a resolution lettered om a handsome scroll. 
The citation was presented by Robert W. Burgess, Director of the Bureau of the Census. 

One innovation was introduced into the yearly mailings in November, which included 
the ballots and dues notices. A form was enclosed for suggestions from the membership 
for nominees for officers in the Association. The form carries spaces for suggestions for the 
offices of President-Elect, Vice-President, Director and Representative-at-Large. All 
forms received by March 1, 1957 will be turned over to the 1957 Committee on Nomina- 
tions when it begins its deliberations. The suggestions sent in by the membership will be 
of great assistance to the committee in selecting outstanding persons to serve as officers 
of the Association. 

Probably the major problem facing the Association is that encountered by most other 
similar groups during recent years. As knowledge becomes more compartmentalized and 
concentrated there is a tendency for splinter groups to form to take care of these spe- 
cialized needs. This is especially true during years of prosperity and expansion when the 
costs of secretariats can be borne more easily or when these costs can be absorbed by 
private or semi-private groups. From the long-range point of view especially, much 
thought should be given to this problem by ASA, and it may be necessary to reassert 
more positively, but in new and varied ways, its historic role as the meeting ground for 
many disciplines, each of which may profit from developments among the others. 


Annual Meeting Plans 


Future Annual Meeting sites are as follows: 
Atlantic City September 10-13 (joint with IMS) 
Chicago Christmas Week (joint with AEA and others) 
Washington, D.C. Christmas Week (joint with AEA and others) 
(open) September (possibly West Coast) 
New York City Christmas Week (joint with AEA and others) 
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It is gratifying to report that the expected surplus of income over expense for the 1956 
calendar year will be the largest ever enjoyed by the Association. As pointed out in the 
Report of the Board of Directors, this increment to surplus more than makes up the loss 
suffered at the end of 1955. The addition places the cumulative total at about $37,700. 

The estimated surplus for 1956 is approximately $13,000. This has been a result 
of a reduction in publication costs (primarily the Journal), economizing all along the line 
in expenses, and substantial increases in income over that budgeted. 1956 income was 
budgeted at $63,410. Estimated actual for the full year is approximately $68,500. 
The items most noticeably up over budget figures are membership dues and subscriptions, 
although increases were noted in other items as well. 

Your Secretary-Treasurer cannot help feeling, however, that the saving in the Journal 
costs, aside from the printing short-cuts and efficiencies that have been worked out, is 
not a good sign. The two major points in the Association for professional development and 
creativeness are the annual meetings and the Society’s publications. The annual meetings 
have become self-sustaining. A year ago the Journal’s budget was badly in the red. This 
year it operated approximately $8,000 below budget. This budget has been raised, but the 
volume of articles submitted to the Editor has fallen off, perhaps temporarily. Causes for 
this have been debated in the Sections and the Publications Committee will help us to 
point up the problem. 

Expenses for 1956 were budgeted at $64,165. Actually, the expected total was ap- 
proximately $55,500. As mentioned abov+, the reduction in cost of the 1956 Journal vol- 
ume is the most significant saving; however, there are other economies. 

It would be appropriate to mention the splendid results obtained by the Committee on 
Institutional Membership, chaired by Virginia Holran. Eleven organizations became In- 
stitutional Members in 1956, bringing the total to a new high of 38. This helped in insur- 
ing that membership income would exceed budget. 

At the end of 1956, the Association has 5,691 members. These fall into the following 
classes: 


Regular and Student........ 5,535 Senior Members..........:. 39 
Family Members............ 52 Honorary Members......... 4 
Life Members..........+... 61 


Note that this count does not include organizations holding Institutional Membership, 
but does include employees who are appointed by the organization as Institutional Rep- 
resentatives, These individuals are counted as regular members, with exactly the same 
privileges. 

Of the 5,691 members, 121 reside in Canada, and 414 in other countries throughout 
the world. 

At the end of 1955 the Association had 5,536 members. During the year, 162 persons 
resigned or died, and at the end of 1956 311 more were dropped for non-payment of dues. 
New members totaled 607 and 21 others reinstated their membership, bringing the 1956 
year-end total to 5,691. The increase in membership was not as much as had been hoped, 
but more intensive promotion in 1957 will bring a greater increase for that year. 

Promotion is also planned for subscriptions to the Journal. The number of Journal 
subscriptions for 1956 reached a new high of 1,657, an increase of 175 over 1955. 

More intensive work is planned also for advertising solicitation in the Association’s pe- 
riodicals. The rates were raised for 1957 to bring them more nearly into line with the 
greater circulation of the Journal and The American Statistician. 

The expected accumulated surplus of $37,000 at the end of 1956 brings the Association 
well on its way to the goal of accumulating an amount equal to about one year’s income. 
With this sum as a “cushion,” certain projects can be undertaken at a loss (such as the 
printing costs, not borne by the Ford grant, of the index to Volumes 35-50 of the Journal 
when it is completed, or the publication of an enlarged new Membership Directory). Other 
projects will receive the attention of the Board and Council during the ensuing year. 

May 10, 1957 
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To the Board of Directors of 
the American Statistica) Association: 

I have examined the accompanying financial statements of the American Statistical 
Association relating to the year ended December 31, 1956. My examination was made in 
accordance with generally accepted auditing standards and, accordingly, included such 
tests of the accounting records and other auditing procedures as were considered necessary 
in the circumstances. 

The recorded cash receipts for the year were traced to the deposits shown on the bank 
statements, and the amounts for dues and subscriptions were tested against the member- 
ship and subscription records. The paid checks were inspected and related vouchers tested 
in support of cash disbursements for the year. The bank balances were reconciled with 
certificates obtained directly from the depositaries, and the cash on hand was counted 
and reconciled with the books during the course of the examination. I did not check the 
membership and subscription records in detail or make any independent verificatior of 
the inventory of back numbers of journals, the office records of which are based, in part, 
on data assembled in prior years. 

In accordance with a resolution passed by the Board of Directors, the expense incurred 
in publishing a directory, distribution to the membership beginning in 1954, was spread 
cv: @ three-year period although such costs would appear to be applicable primarily 
to the year 1954. The accounts for the year ended December 31, 1956, reflect a charge of 
$1,787.10, representing the remaining balance of the directory expense applicable to that 
period. 

In my opinion, the accompanying statements present fairly the financial position of the 
American Statistical Association on December 31, 1956, and the results of its operations 
for the year then ended, in accordance with generally accepted accounting principles, 
except as mentioned in the previous paragraph, applied on a basis consistent with that 
of the preceding year. 

James G. JESTER 
Certified Public Accountant 
Tower Building, Washington 5, D. C. 


(Exhibits and Schedule follow on pages 296-299) 
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Exhibit A 
AMERICAN STATISTICAL ASSOCIATION 
ComPaARATIVE BALANCE SHEET 
Assets 
December 31 
1956 1966 
Cash in banks and on hand—Schedule I................... $77,042.72 $55,584.20 
Investments: 
United States Savings bond, Series G, due 1962, at cost... . 3,100.00 3,100.60 
United States Treasury 24% bonds, $7,000 face amount, 
Inventories: 
Old Journals, at approximate 2,922.00 2,707.50 
Monograph on Acceptance Sampling, at cost jade ws 7.94 
Monograph on Kinsey Report, at cost.................. 2,848.87 3,039.04 
Furniture and Fixtures, at cost, less accumulated depreciation 1,660.42 1,666.38 
Deferred charges: 
Deferred Membership Directory expense....... 1,787.10 
Annual meeting expense... .... 382.00 
Prepaid insurance and fidelity bond.................... 202 .02 358.96 
Liabilities and Net Worth 
Deferred income (collections applicable to 
subsequent years): 
19,640.00 
Net worth: 
Total Liabilities and Net Worth.....................005. $96,716.66 $73,290.90 
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AMERICAN STATISTICAL ASSOCIATION 
CoMPARATIVE STATEMENT OF INCOME AND SURPLUS 


Year ended December 31, 


Income: 


Dues-——-Old Members, including life members 
—New Members 


—Kinsey Report Monograph 

—Acceptance Sampling Monograph 
—Business and Economic Section Proceedings 
—Emblems, less cost of sales 

—Biometrics 


Mailing list income 
Interest income 
Annua) meeting 
Miscellaneous income 


Total income 


Publications—Schedule I 
Promotion 


Telephone and telegraph 
Accounting services 

Committee expense 

Annual meeting expense 
Miscellaneous expenses—Schedule I 


$74,507.79 


Excess of income over (expenses) for the year $(5, 550.99) 
Add: Surplus account at beginning of year 30 , 360.81 


Surplus account at end of year—Exhibit A $24,809.82 
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Exhibit B 
1956 1955 
700-25 
4,683.00 3,799.25 
12,441.53 11,170.30 
—American 504.45 468 .20 
—American Statistician 699.76 535.00 
602.00 2,571.57 
80.50 
456 .69 2,143.72 
62.67 62.58 
Expenses: 
Travel and secretarial 732.55 735 .30 
887.45 919.81 
252.45 
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AMERICAN STATISTICAL ASSOCIATION 


Schedule I 


$18 503.37 

6 ,652 .67 

Monographs—Kinsey Report 190.17 
—Acceptance Sampling 

Business and Economic Section Proceedings 40.18 

Membership Directory, allocated expense 1,787.10 


Total publications—Exhibit B $27,173.49 
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: 
Year ended December 31, 
1966 1955 
Publications: 
Journal—Printing......... $15,078.02 $25,559.08 
—Editorial 2,493 .26 2,363.28 
—Cost of old Journals. ... 10.00 42.50 
$28 , 903.09 
6,665.85 
‘ 1,978.85 
37.55 
: 4,046.99 
1,787.11 
419.44 
Miscellaneous Expenses: 
Dues to other organizations. 122.25 123.25 
Repaite and maintenance. 115.71 155.90 
Total miscellaneous expenses—Exhibit B................ $ 1,790.00 $ 1,923.73 7 
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Cash on hand and in banks: 


American Security and Trust Co.— 
Operating checking: 
Special checking 
Bank of Nova Scotia 
Office petty cash fund 
American Building Association, Washington, D. C 
City Federal Savings & Loan Association, Milwaukee, Wis- 


Guardian Savings & Loan Association, Chicago 
Hyattsville Building Association, Hyattsville, Maryland.. 
Interstate Building Association, Washington, D. C 
Jefferson Federal Savings & Loan Association, Washington, 


Liberty Building & Loan Association, Washington, D. C.. 
Liberty Savings & Loan Association, Los Angeles, California 
National Permanent Building Association, Washington, 
5,437.53 5,252.10 


9,113.32 


$77,042.72 $55,584.20 
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December 31 
1956 1955 
| $19,816.61 $19,095.80 
3.49 6.49 
293.53 273.19 
20.00 20 .00 
5,390.53 5,206.71 
: Fidelity Federal Savings & Loan Association, Glendale, Cal- 
5,015.63 
1,092.57 1,054.27 
8,140.48 10,265.85 
Piedmont Federai Savings & Loan Association, Manassas, 
. Total cash in banks and on hand—Exhibit A............ po 


BIOMETRIKA 


Vol. 4, Parts 1 & 2 
CONTENTS 


PEARSON, E. 8S. John Wishart, 1898-1956. Obituary and Bibliography 
ARMITAGE, P. Restricted sequential procedures 
BARTLETT, M. 8. On theoretical models for competitive and predatory biological 


PATIL, V. T. The consistency and adequacy of the Poisson-Markoff model for density 
fluctuations 


HANNAN, E. J. Testing for serial correlation in least squares regression 
— ee H. M. On the analysis of multiple regression in k 
ca 


BROWN, RB. L. Bivariate structural relation 
WHLIOOM,,d. W. An analysis of paired comparison designs with incomplete repeti- 
ons 


MALLOWS, C. L. Non-null ranking models. I. 

AITCHISON, J. and SILVEY, 8. D. The generalization of probit analysis to the case 
of m ultiple responses 

PEARCE, 8. C. Experimenting with organisms as blocks 

COX, D. R. The use of a concomitant variable in selecting an experimental design 

BULMER, M. G. Approximate confidence limits for components of variance 

BARTON, D. E. and DAVID, F. N. Multiple runs 

LINDLEY, D. V. Binomial sampling schemes and the concept of information 

LINDLEY, D. V. A statistical paradox 

HASKEY, H. W. Stochastic cross-infection between two otherwise isolated grcups 


MACKENZIE, J. K. and THOMSON, M. J. Some statistics associated with the random 
disorientation of cubes 


DARWIN, J. H. The difference between consecutive members of a series of random 


HARLEY, B. I. Relation between the distributions of non-central ¢ and of a trans- 
formed correlation coefficient 
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Do you belong in IBM Applied Science? 
This gensuuns Job is now open! 


Recently promoted, this man formerly 
worked as an IBM Applied Science Repre- 
sentative out of the Boston office. Using 
electronic digital computers, he developed 
methods for solving commercial and scien- 
tific problems in mathematical language. 
Working with many New England indus- 
tries and institutions, he found a new chal- 
lenge every week. He outlined the methods 
for designing a giant transformer; for 
shielding a nuclear reactor; for investigat- 
ing the mathematical aspects of sales fore- 
casting. He also assisted colleges and uni- 
versities—teaching computer applications. 


Could you fill it? 


A previous knowledge of computing is 
not necessary. IBM will pay for your 
training. 

Throughout the United States, IBM 
maintains approximately 100 Applied 
Science offices. You may request assign- 
ment in the location that is most desir- 
able to you. 

IBM Applied Science has quadrupled 
its staff during the past three years. In 
1956, over 70 promotions were con- 
ferred. Doesn Doesn’t this growth factor alone 
suggest more room for your abilities— 
more professional growth? 

Why not act today? Please write to: 

Mr. W. E. Leavitt 


ELECTRIC TYPEWRITERS 
MILITARY PRODUCTS 
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TIME EQUIPMENT 
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© Advise customers and prospecis of 
the scientific and technical applica- 
tions of IBM electronic equipment. 


® Analyze customers’ technical prob- 
lems in terms of machines and their 
applications, 

® Deliver talks about the computing 
field—supported by demonstrations— 
to customers, prospects, scientific 
groups, and IBM personnel. 

© Maintain constant and close contact 
with the customers’ top management 
and associated IBM executives. 

© Continually analyze customer appli- 
cations and develop new machine 
uses. 


© Arrange for training of customers’ 
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country. Its future is unlimited. If you want to join 

a group of pioneers in this exciting field, we invite 

you to investigate the openings on our staff. 

Cn our part we offer: 

1. A record of experience in operations research, — 
out-distanced by perhaps no other organization, 

2. A scrupulously maintained professional ap- 
proach and atmosphere. 

3. The team approach to problem solving. On each 
team are representatives of varied disciplines — 
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4. pom th equipped digital and analog computing 
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Maryland, one of Amierica’s most attractive 
suburbs. Pleasant homes and apartments in all 
price ranges are available. Schools are excellent. 
Downtown Washington, [), C. with its many 
cultural and recreational advantages is but a 
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6. Favorably competitive salaries and benefits, ex- 
tensive educational programs, unexcelled leave 
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Dr. L. F. Hanson 


OPERATIONS RESEARCH OFFICE 


The Johns Hopkins University 


7100 CONNECTICUT AVENVE 
CHEVY CHASE 15, MARYLAND 


Please mention the Journal of the Amenican Statisticat Association in writing advertisers 


‘ 
. 


BIOMETRICS 
Journal of the Biometric Society 


Vol. 13, No. 1 CONTENTS March 1957 


Incomplete Block Designs with Row Balance and Recovery of Inter-Block 
Information (W. B. Taylor) ; Extension of Multiple Range Tests to Group Cor- 
related Adjusted Means (Clyde Young Kramer); An Application of the Kron- 
ecker Product of Matrices in Multiple Regression (E. A. Cornish) ; Tables for 
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van der Veen); Queries and Notes: Interpretation of x* Tests (P. Armitage 
and M. J. R. Healy) ; Missing Plot Estimates (H. Fairfield Smith). 


Biometrics is published quarterly. Its objects are to describe and exemplify 
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Association may subscribe through the Association at the rate of $4 yearly. The 
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» Are medical costs adequately 
covered by health insurance plans ? 


Voluntary Health 


Insurance in Two Cities 
A SURVEY OF SUBSCRIBER HOUSEHOLDS 


By ODIN W. ANDERSON, and the Staff of the 
National Opinion Research Center 


o< it is generally assumed that expenses for hospital care 

and surgery are the medical costs which are most difficult 
for families to meet, current insurance plans are directed toward 
covering these expenses. What percentage of family medical bills 
actually arises through hospitalization, however? And what per- 
centage arises through home and office calls, obstetrical services, 
dental care and costs of drugs and medicine? Mr. Anderson’s 
study reveals that these miscellaneous services, which are not 
covered by voluntary health insurance, are a major problem for 
some families. 


"T's important book is the first to be based on a scientific 

sampling of subscriber households in specific local areas— 
Birmingham, Alabama, and Boston, Massachusetts. By studying 
health insurance plans in the light of consumer needs, Mr. Ander- 
son sets a pattern for any future evaluation of insurance benefits, 
and brings many new facts to bear on a problem of vital interest. 


a=: who are associated with Blue Cross and Blue Shield plans, 
embers of insurance companies, schools of public health, 

I sae and medicine, and persons in Health, Education and 
Welfare departments will find this detailed survey both interest- 
$5.00 


ing and enlightening. 
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Barriers to New Competition B, Heflebower 
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Review of Books, Titles of New Books, Periodicals, Notes 


The AMERICAN ECONOMIC REVIEW, a quarterly, . 4 the official publication of the 
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tabulating facilities that utilize all the 
latest equipment for speed, accuracy 
and economy, we believe that our 
responsibility to research directors 
does not start with these electro- 
mechanical services. 

Long experience in working with 
publishers, agencies and research de- 
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that the quality of the finished reports 
depends in large measure on the 
thinking and planning that go into 
the project from its inception. 

That’s why STATISTICAL makes pro- 


STATISTICAL 


TABULATING CORPORATION 


MARKETING RESEARCH 


fessional help available to you, right 
from the start. In resolving your 
ideas. In translating sound thinking 
into all-inclusive questionnaires. And 
then backing up this all-important 
preliminary assistance with respon- 
sible help in the mechanical and 
analytical phases of your project. 

Just a few minutes with one of our 
what this complete service can mean 
to you in producing the quality 
analyses and interpretations required 
in today’s competitive markets. 


Just phone our nearest office. 


CHICAGO 

53 West Jackson, HArrison 7-4500 
NEW YORK 
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TABULATING - 
TEMPORARY OFFICE PERSONNEL 


CALCULATING + TYPING 


NEWARK 
National-Newark Bidg., M Arket 3-7636 


CLEVELAND 
1375 Euclid Ave., SUperior 1-8101 


Please mention the Journal of the Amentcan Sratisticat Association in writing advertisers 


The Economic Status of the Aged 
By Peter O. AND Rosert DorFMANn 
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Retirement Policies Under Social Security 
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of the social, and financial factors considered by 
in establishing and modifying the social security program. 
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Spending of Middle-Income Families 


By Emmy H. Huntincton 


A Heller Committee study centered on white-collar families in the San 
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Paperbound $1.45 
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w. Short-cut methods for linear simultaneous 
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diagrams. 400pp. 5% x 8. Paperbound 
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NUMERICAL METHODS, F. A. Willers. Hun- 
dreds of shortcuts for numerical solutions. Spe- 
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Paperbound $2.00 
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x 8, Paperbound $1. 
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MATHEMATICAL RECREATIONS, M, Krait- 
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CRYPTANALYSIS, H. Gaines. Standard ele- 
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Dept. 109, Dover Publications, 920 Broadway, New York 10, N.Y. 
(Be sure to mention department number) 


Please mention the Journal of the Amenican Statistica Association in writing advertisers 


: 
__ 


STATISTICIANS 


We need statisticians with 
advanced degrees and ex- 
perience in statistical design 
of experiments for industrial 
processes, formulation of 
sampling plans, and the 
establishment of quality 


querque, famous for its dry, 
mild climate. We offer espe- 
cially good working condi- 
tions and generous benefits, 
in one of the country’s most 
interesting areas. 


Write Staff Employment Division 


control programs in elec- 
tronics or small mechanical 
devices. 


Sandia Corporation is loca- 
ted in metropolitan Albu- 


SAN DIA 


CORPORATION 


A Complete Introduction to... 


PRINCIPLES OF STATISTICAL ANALYSIS 


SAMUEL B. RICHMOND, Columbia University 


gust Published! A new textbook designed as an intro- 
duction to statistica! analysis for students of business and eco- 
nomics. Detailed illustrative material combines with the text to 
present a thorough treatment of the collection, analysis, and pre- 
sentation of statistical data. The book is organized around the 
modern concept of statistical induction. Mathematical procedures 
are kept to a minimum, and those techniques employed are intro- 
duced and explained at the point of use. A unique feature is the 
Glossary of Equations in which each equation in the text is listed, 
located, and explained. 210 ills., tables; 491 pp. $6.50 


Handbook of Graphic Presentation 


CALVIN F. SCHMID, University of Washington 


Detailed working guide for all con- each basic type, including 3-dimen- 
cerned with the presentation and in- sionals, showing its advant and 
terpretation of statistical data with disadvanta or different data. 

charts and graphs. Book analyzes 210 ills., tables; 31 316 pp. $6.00 


“THE RONALD PRESS COMPANY ° 
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Electronic Countermeasures 
To Outwit the Missiles 
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in San Francisco Bay Area 
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Continued expansion in systems work has created 
openings at all levels for: 

Mathematicians 

Statisticians 

Physicists 

Engineers 


IN RESEARCH & DEVELOPMENT 


in electronic countermeasures against guided missiles. 


e Analysis 

Advanced Design 
¢ Simulation 
© Operations Research 


EDUCATIONAL OPPORTUNITIES 


We offer financial assistance to those who wish to 
take additional course work at any of the four univer- 
sities and colleges located in our immediate vicinity. 


@ 
Address inquiries to: 


Alfred E. Halteman 
Systems and Projects Dept. 


SYLVANIA ELECTR! PRODUCTS ING. 


P.O. Box 205 
Mountain View, California 
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ECONOMETRICA 
Journal of the Econometric Society 


Vol. 25, No. 3, July, 1957 
CONTENTS 
ZVI GRILICHES: Hybrid Corn: An Exploration in of Change 
F. E, A. BRIGGS: On Problems of Estimation in Leontief Mod sane 
: Butter Demand and Inventories 
R. STONE AND D. A. ROWE: The Market Demand f= Ponti Goods 
ARNOLD ZELLNER : The Short-Run Consumption Fu 
‘ D. SARGAN: The Distribution of Wealth 
0. A. WOLD AND P. WHITTLE: A Model Exp way ad Law of Wealth Distribution, 
W. McKENZIE: An Anal; of the 
REPORT OF THE AIX-EN-PROVEN 
REPORT OF THE CLEVELAND MEET 


BOOK REVIEWS 


eng région Lorraine (Pierre Bauchet). Review by Otto 


ot Vols. I and (André Marchal). Review by Hugo 


egeland 

Les a et le concret (Maurice Fréchet). Review jy ee J. Savage. 

eT ay Parameterteori ( Theory or Parameter Theory) (Arne Rasmussen), Review 
ans Brems. 

Economic Growth and Instability (D. Hamberg). Review by Rohe Solow. 

Mercantilism (Eli F. Heckscher). Review by Joseph J. 

The Evolution of Wage Structure (Lloyd G. Reynolds an Cynthia H. Taft). Review by Harold W. 


Davey. 
Ui y in E ics and ~ Reflections (G. L. S. Shackle). Review by W. 
Cahiers du Séminaire . Les modéles econo Review 
toine. 
rung in die Wirtschaftstheorie (E. Schneider). Review by Jiirg Niehans. 


(and others) 


STATISTICIAN 


Detroit research laboratories has opening in statistical planning 
group. Requires person with M.S. or Ph.D. degree in statistics and 
2 to 5 years industrial experience in engineering, chemical or physi- 
cal applications. Principal duties involve the planning and analysis 
of experimental work of a diverse nature encountered in our auto- 
motive and chemical research laboratories. The data are often 
characterized by high variability and high cost. A small digital 
computer is available to the group. For more particulars write to: 

Personnel Manager 

Ethyl Corporation Research Laboratories 


1600 W. Eight Mile Road 
Ferndale 20, Detroit, Michigan 
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UNITED 


STATISTICAL 
YEARBOOK 
1956 


A comprehensive collection of authori- 
tative statistics from 151 countries and 
territories, covering a wide range of 
economic and social subjects. This 
volume includes for the first time ex- 
tensive official statistics for the 
U.S.S.R. and East European countries. 
Most of the tables cover a twenty-year 
mesa ending 1955, and some are 
— up to mid-1956. Subjects in- 
clude: 


Population 
Agriculture 
Fishing 
Mining 


Transport 
Internal and external trade 
WwW 


© Balance of payments 
© Social Statistics 
646 pp. Published. 
Cloth $7.50, paper $6.00 


NATIONS 


DEMOGRAPHIC 


YEARBOOK 
1956 


This eighth issue of the Yearbook con- 
tains summary and detailed statistics 
from censuses of populations carried 
out between 1945 and 1955 in sovereign 
countries, dependencies, and territories. 
Population statistics of special interest 
include: distributions by religion, lan- 
guage, country of birth, ethnic group 
(race), statistics on level of education 
and school attendance, cross-classifica- 
tions of the economic characteristics— 
industry, occupation, and status, An 
analytical chapter on “The World 
Demographic Situation” supplements 
the regularly published technical notes 
which describe the limitations in com- 
parability of the statistics in each table. 
Also included are tables on: 
e Area and density 

Population trends 

Population composition 

Live births 


STATISTICS OF NATIONAL 
INCOME AND EXPENDITURE 


This volume contains the latest available data for eighty-one countries and 
territories. Each country chapter contains a brief introductory text (Source 
and General Notes) and the statistical tables proper. In this tenth issue of this 
series are estimates for each of the following subjects for the years 1950-1955: 
¢ Expenditure on gross national product ¢ Industrial origin of gross domestic 
product ¢ Distribution of the national income ¢ The finance of gross domestic 
capital formation ¢ Composition of gross domestic fixed capital formation by 
type of capital good, by industrial use and by of purchaser ¢ Receipts and 
expenditures of households and private non-profit institutions ¢ Composition of 
private consumption expenditure ¢ General government revenue and expendi- 
ture ¢ External transactions. 280 pp. Published. Paper $3. 00 


International Documents Service 
COLUMBIA UNIVERSITY PRESS 


New York 27, New York 
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e Expectation of life 
750 pp. Published. 
Cloth $8.50, paper $7.00 


_ INTRODUCTION to MODERN STATISTICS 
With Application to Business and Economics 


by Werner Z. Hirsch, Associate Professor of Economics, 
Washington University, and Chief Economic Research 
and Assistant Research Director of the Metropolitan 
St. Louis Survey 

A new introductory treatment of statistics for students of 
business administration and economics—based upon the 
probability approach—this text presupposes no mathematical 
knowledge beyond high school algebra. It introduces the 
reader to modern inference methods and gives him a sound 
understanding of the tools and ideas of statistics, particularly 
as they apply to decision-making. Comprehensive and stimu- 
lating, the text offers numerous interesting, classroom-tested 
exercises and includes such modern topics as managerial and 
quality control, electronic computers, and sampling (and at 
the same time gives emphasis to index numbers and time 
series analysis). Ready Summer 1957 


EXPERIMENTAL DESIGN 

Theory and Application 
by Walter T. Federer, Professor of Biological Statistics 
in charge of the Biometrics Unit, Department of Plant 
Breeding, New York State College of Agriculture, Cor- 
nell University 

“Although an unduly large number of books on statistical 
methods have been published recently, the number devoted 
strictly to the subject of experimental designs has remained 
relatively small, Hence there was a need tor a book of this 
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